Gene Gdia_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3031 
SymbolcarB 
ID6976465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3316189 
End bp3319443 
Gene Length3255 bp 
Protein Length1084 aa 
Translation table11 
GC content68% 
IMG OID643392539 
Productcarbamoyl phosphate synthase large subunit 
Protein accessionYP_002277376 
Protein GI209545147 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.144026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0049366 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCAAAC GGACAGATAT CCGCTCCATC CTGATCATCG GCGCTGGTCC GATCGTCATC 
GGCCAGGCGT GCGAATTCGA CTATTCCGGC GCCCAGGCCT GCAAGGCCCT GCGCGAGGAA
GGGTACCGGG TCATCCTGGT CAATTCCAAC CCGGCCACGA TCATGACCGA TCCCGGACTG
GCGGACGCGA CCTATGTCGA GCCGATCACC CCGGAATTCG TCGAGCGGAT CATCCTGCGC
GAAAAGCCGG ACGCCATCCT GCCGACCATG GGCGGCCAGA CGGCGCTGAA CACCGCCATG
GCGCTGGACA AGTCCGGCTT CCTGAAGGAA CACGGGGTGG AGTTGATCGG CGCTGACGCC
GAAGTCATCG ACCGTGCCGA GGACCGGCAG AAATTCCGCG AGGCGATGGA CGCGATCGGC
ATCGAAAGCC CGCGCAGCGT GATCGCCCAT ACGCTGGACG AGGCCCGCGC GGCGCTGGAG
CAGGTCGGCC TGCCCGCGGT CATCCGTCCC TCGTTCACCA TGGGCGGCTC GGGCGGCGGC
ATCGCCTATA ACCGCGAGGA ATTCGACCAG ATCGTGGCCT CGGGCCTCGA CGCGTCCCCG
ACCACCGAGG TGCTGATCGA GGAATCGGTG CTGGGCTGGA AGGAGTTCGA GATGGAGGTC
GTCCGCGACA GCGCGGACAA CTGCATCATC GTGTGCTCGA TCGAGAACAT CGATCCGATG
GGCGTGCATA CCGGCGATTC CATCACCGTC GCGCCGGCGC TGACCCTGAC CGACAAGGAA
TACCAGCGCA TGCGCGACGC CTCGCTGGCG TGCCTGCGCG CCATCGGCGT CGATACCGGC
GGGTCGAACG TGCAGTTCGG CGTCAACCCC GCCGACGGCC GCATGGTGGT CATCGAGATG
AATCCCCGCG TCTCGCGCTC CTCCGCGCTG GCGTCCAAGG CCACGGGCTT TCCCATCGCC
AAGGTCGCGG CCAAGCTGGC CGTGGGCTAC ACGCTGGACG AACTGGCCAA CGACATCACG
GGCTCGACCC CGGCGTCGTT CGAGCCGACC ATCGACTATG TGGTGGTCAA GATCCCGCGC
TTCACCTTCG AGAAATTCCC CGGCACCCCC GCCCTGCTGT CCACCAGCAT GAAATCGGTC
GGCGAGGCGA TGGCCATCGG CCGCTCGTTC CCCGAGGCGC TGCAGAAGGG CCTGCGCTCG
ATGGAAACCG GGCTGGCCGG GCTGGACCCG GTCGAGGCCC CGGGCGACGG CGGCGAGGAC
GCGTTCCGCG CCGCCCTGTC CCAGCCCCGG CCGGAACGGA TCCTGATGGC CGCCCAGGCG
TTGCGCGCCG GCCTGGGCGT GGATGAAATC CACGCCGCCT GCCGGTTCGA GCCCTGGTTC
CTGCGCGAGT TGCAGAAGAT CGTGGCGGCG GAACATGCCG TGGTCCGCGA CGGCCTGCCC
CAGGACGCGC TGGCGCTGCG CCGGCTGAAG GCGCTGGGCT TCTCGGACGT GCAGCTCGGC
CGCCTGTCGG GCACCGGCGC GCACGAGGTC GCGTCCCTGC GCGCGCGGCT GGCGGTCGCG
CCCGTCTACA AGCGGATCGA CACCTGCGCC GGCGAATTCG CCTCGGCCAC GCCCTATATG
TATTCGACCT ACGAGGGCGG GTTCGGCGTG CCGGATTGCG AAAGCCACCC GACCGACCGG
CGCAAGATCG TGATCCTGGG CGGCGGTCCC AACCGCATCG GCCAGGGGAT CGAATTCGAC
TATTGCTGCG TCCACGCCGC CTATGCGCTG CGCGAGGCCG GGTTCGAGAC CATCATGGTC
AACTGCAACC CCGAGACCGT CTCGACCGAC TACGACACCT CGGACCGCCT GTATTTCGAG
CCGCTGACCG AAGAGGACGT GATCGCGCTG ATCCGGCGCG AGCAGGCCAG CGGCACCGTG
CTGGGCTGCA TCGTGCAGTA TGGCGGCCAG ACGCCGCTGA AGCTGTCCCG CGCGCTGGAG
GCCGCCGGCA TTCCGCTGCT GGGCACGCCG GCCGACGCCA TCGACCGCGC CGAGGACCGC
GAGCGGTTCC AGGCCATGCT GCACAAGCTG GGCCTGCGCC AGCCGGACAA CGGCATCGCC
CGCACGGCGG CCGAGGCCGA GGACGTGGCC GAGCGCATCG GCTACCCCGT CGTGATCCGC
CCGTCCTACG TGCTGGGCGG CCGGGCGATG GAAATCGTCC ATGACCGCGC CAGCCTGCAG
CGCTACATGC GCGTGGCGCT GCAACTGGCC GGGCACGACA TCGCCTCGGG CCCGGTGCTG
ATCGACCGCT ACCTGAACGA CGCGATCGAG GCCGATGTGG ACTGCATTTC CGACGGCCAC
ACCGTCTATG TCGCCGGCGT CATGGAACAT ATCGAGGAAG CGGGCATCCA TTCCGGCGAC
AGCGCCTGCT CGCTGCCGCC CTACACCCTC TCGCCCGCCA TCGTCACCGA ACTGAAGGAA
CAGACCGAGG CGATGGCCCG CGAACTGGGC ATCGTCGGGC TGATGAACGT CCAGTACGCC
ATCAAGGACC AGGACATCTT CGTCCTGGAG GTCAATCCCC GCGCCTCGCG CACCGTGCCC
TTCGTCGCCA AGGCGACCGG CGTGCCGGTG GCCAAGATCG GCGCGCGGGT CATGGCCGGG
GCACGGCTGT CGGAATTCCG GCTGGACGAC CGCGCCGTGG CGCCGCACGT CGCGGTGAAG
GAAGCGGTGT TCCCCTTCAA CCGCTTCCCC AACGTGGACA CGATCCTGGG CCCGGAAATG
CGCTCCACCG GCGAGGTGAT GGGCCTGGAC GCGTCGTTCG AGCGGGCCTT CGCCAAGTCG
CAGCTTGCCG CCGGGGTGCG GCTGCCGCTG TCGGGTGTGG TGTTCCTGTC GGTGCGCCGC
AGCGACAAGG CCGCCATCCC GGCGCTGGCC CGCCGGCTGG TGGACATGGG CTTCACCATC
CTGGCCACCC GGGGCACCGC GCAGCACCTG CGCGACGCCG GGATCGCGGT CGAAGTGGTG
AACAAGGTGC TGGAGGGACG GCCCAACTGC GTCGATGCCA TCCGATCGGG CGACGTGCAG
ATGATCATCA ACACCGCGCA GGGCGCCCAG TCGGTGACGG ACAGCTTCGA CATCCGCCGT
TCGGCGCTGA CAACCGGCAT TCCGCACTTC ACCACCATCG CGGGCGCGCG GGCCGCGACC
CACGCAATCG CGGCAATGCG GGAAGGACCG CTTGAAGTCG CGCCCCTTCA ATCCTACTTT
AGCGGATCGT TCTGA
 
Protein sequence
MPKRTDIRSI LIIGAGPIVI GQACEFDYSG AQACKALREE GYRVILVNSN PATIMTDPGL 
ADATYVEPIT PEFVERIILR EKPDAILPTM GGQTALNTAM ALDKSGFLKE HGVELIGADA
EVIDRAEDRQ KFREAMDAIG IESPRSVIAH TLDEARAALE QVGLPAVIRP SFTMGGSGGG
IAYNREEFDQ IVASGLDASP TTEVLIEESV LGWKEFEMEV VRDSADNCII VCSIENIDPM
GVHTGDSITV APALTLTDKE YQRMRDASLA CLRAIGVDTG GSNVQFGVNP ADGRMVVIEM
NPRVSRSSAL ASKATGFPIA KVAAKLAVGY TLDELANDIT GSTPASFEPT IDYVVVKIPR
FTFEKFPGTP ALLSTSMKSV GEAMAIGRSF PEALQKGLRS METGLAGLDP VEAPGDGGED
AFRAALSQPR PERILMAAQA LRAGLGVDEI HAACRFEPWF LRELQKIVAA EHAVVRDGLP
QDALALRRLK ALGFSDVQLG RLSGTGAHEV ASLRARLAVA PVYKRIDTCA GEFASATPYM
YSTYEGGFGV PDCESHPTDR RKIVILGGGP NRIGQGIEFD YCCVHAAYAL REAGFETIMV
NCNPETVSTD YDTSDRLYFE PLTEEDVIAL IRREQASGTV LGCIVQYGGQ TPLKLSRALE
AAGIPLLGTP ADAIDRAEDR ERFQAMLHKL GLRQPDNGIA RTAAEAEDVA ERIGYPVVIR
PSYVLGGRAM EIVHDRASLQ RYMRVALQLA GHDIASGPVL IDRYLNDAIE ADVDCISDGH
TVYVAGVMEH IEEAGIHSGD SACSLPPYTL SPAIVTELKE QTEAMARELG IVGLMNVQYA
IKDQDIFVLE VNPRASRTVP FVAKATGVPV AKIGARVMAG ARLSEFRLDD RAVAPHVAVK
EAVFPFNRFP NVDTILGPEM RSTGEVMGLD ASFERAFAKS QLAAGVRLPL SGVVFLSVRR
SDKAAIPALA RRLVDMGFTI LATRGTAQHL RDAGIAVEVV NKVLEGRPNC VDAIRSGDVQ
MIINTAQGAQ SVTDSFDIRR SALTTGIPHF TTIAGARAAT HAIAAMREGP LEVAPLQSYF
SGSF