Gene Cagg_2456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2456 
Symbol 
ID7266180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2976440 
End bp2979730 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content60% 
IMG OID643567283 
Productcarbamoyl-phosphate synthase, large subunit 
Protein accessionYP_002463765 
Protein GI219849332 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) 
TIGRFAM ID[TIGR01369] carbamoyl-phosphate synthase, large subunit
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGC GCACCGATCT CCATACCATT CTGATTATCG GCTCCGGCCC AATCGTGATC 
GGCCAAGCCT GTGAATTCGA CTATTCCGGC ACCCAAGCGT GCAAGGCGTT GCGTGAAGAG
GGCTATCGGG TCGTGCTCGT CAATTCAAAC CCGGCCACCA TCATGACCGA TCCCGGCCTC
GCCGACGCCA CCTACATCGA ACCACTGACC GTCCCCAGTC TCGAACGCAT TATTGCCCGT
GAGCGCCCTG ACGCGCTTTT ACCGACCGTC GGCGGCCAGA CTGCGCTCAA TTTGGCGGTA
GCGCTCCACG AGGCCGGTAT CCTCGACAAG TACGGTGTCG AGTTGATCGG CGCATCGGTT
GAGGCGGTGC GGATCGCCGA GGATCGCCAA CGCTTTAAAG ATAAGATGAT CGAGATTGGG
TTGCAGGTAC CTCGCTCCGG CACCGCAACT ACCCTCGATG AAGCGTTGGC AGTCGTTGCC
CAGACCGGCT TCCCGGCGAT TATTCGACCA TCGTTTACCC TTGGTGGTGA AGGTGGCGGT
ATTGCGTACA ACATGGAAGA GTTCCGCGCG ATTGTTGAAC GCGGTCTCGA TGCGTCGCCG
GTCTCGCAGG TGCTAATCGA AGAGAGTGTC CTCGGTTGGA AAGAGTTTGA GCTTGAGGTG
ATGCGCGACC GGAACGACAA CGGCGTCATC ATCTGCTCAA TCGAGAATAT TGACCCGATG
GGGGTGCATA CCGGCGATAG CATTACCGTC GCCCCGGCGA TGACGCTCAC CGACCGCGAG
TACGAGCGGA TGCGCGATAT GGGCTTGGCC GTCCTTCGCG CCGTTGGTGT TGAAACCGGT
GGCTCAAACG TGCAGTTTGC CGTCTCGCCG ACTGATGGTC GCATCTACGT GATCGAGATG
AACCCACGGG TGTCGCGGTC GTCGGCGTTG GCCTCTAAAG CGACCGGCTT CCCCATCGCC
AAAATTGCCG CCAAGCTGGC CGTCGGCTAT ACCCTCGACG AGTTGCCCAA CGATATTACC
CGCGAGACGC CGGCCTCATT CGAGCCGACC CTCGACTATG TGGTGGTGAA GATTCCACGC
TTTACCTTCG AGAAGTTTCC CCAAGCTGAT CAGACGCTCA CCACCTCGAT GAAGTCGGTG
GGTGAAGTGA TGGCGATTGG ACGCACCTTC CCCGAAGCTT TCCAAAAGGC GTGGCGTTCG
CTTGAGCAGG GCCGGGCCGG TTGGGGGGCC GATGGCCACG ACGCGATCGA GCCGGAGCGG
CTGCGCGAGC GATTGATTAC GCCGCATCCC GACCGTATGT TTTATGTGCG CTACGCGCTG
CAAAGCGGGA TGTCAGTCGA GCAGATTAGC GCGCTGACCA AGATCGATCC GTGGTTTATT
CGCCAGCTCG AACAGCTCGT CAATCTCGAA GGGCGGTTAC GGGCATTTGA TCTCAACACC
ATTCCGCCCG ACCTGTTGCG CCAGGCCAAG CGGATGGGCT TCAGCGATGC CCAATTGGCC
CATCTCCTGC GCGTGCCACC CGGCCCGCAA CGCTGGTCGG CGGAATTGGC CGTGCGCAAG
CGTCGCCTCG AACTCGGTAT TCGCCCCACC TTCCACCGGG TTGATACCTG TGCCGCCGAG
TTCCCGGCTT TTACGCCCTA CCTCTACTCG TCGTATGAGA GCGAAGATGA GTCTGAGCCG
ACCGATCGTA AAAAGGTGGT GATCCTTGGT TCTGGTCCCA ACCGGATCGG GCAGGGTATT
GAGTTTGATT ACTGCTGTAG CCATGCCGTG TTCGGGTTGC GTGCGCTTGG CTATGAGACC
ATTATGGTCA ACTGCAACCC AGAGACGGTC TCGACCGACT ACGACACCGC CGACCGGCTC
TATTTCGAGC CGCTTACCCT TGAGGACGTG CTCAACGTCG TTGAAGAGGA GCGGCCCGAT
GGGGTGATTA TCCAGTTTGG CGGTCAGACG CCGCTCAAGT TGGCGCGGGC GCTCGAAGCG
GTTGGGGTGC CGATCTGGGG CACTATGCCT GAAGCCATCG ATCTGGCCGA GGATCGCGAC
CGATTTGGGG CGTTGTTGAA AGAGTTGAAC ATTCCCGCGC CTGAACATGG CAGTGCTACG
TCGTGGGAAG AGGCGCTGAG CGTGGCCCGG CGGATCGGCT ATCCGGTGGT GGTGCGCCCT
AGCTATGTGC TGGGTGGGCG GGCAATGGCG ATTGTTTACG ATGATGCGTC GCTGGAACGC
TATATGCGAG AAGCGGTTGC TGCGTCGCCC GAGCATCCAG TGCTGATTGA CCGCTTCCTC
GAAGATGCGT TTGAGATGGA TGTCGATGCG GTCTGTGATG GCGAGACGGT CGTGATCGCC
GGGATCATGG AGCAGATTGA GCTGGCTGGC GTTCACTCCG GCGATAGCGC CTGCGTCATC
CCGACCTATA TGGTGGCTGA AGAGCACGTC GCGACGATGC GCCGCTATAC CGAGCAATTG
GCTCGCGCGT TGGGTGTGGT CGGCCTGATG AATATCCAGT ACGCGATGAA AGACGGCGTC
GTCTACGTGC TTGAAGTCAA CCCGCGCGCC TCGCGCACCG TGCCTTTCGT TGCTAAGGCG
ACCGGTGTGC CGTGGGCGCA ATTGGCGGTG CAGTGCGCTG CCGGCGCTCG GCTGCGGTAC
GAGCATGGCC GCATTCAGAT GGTGCATCCG GCCTTGGTGA ATTCGCCTCG CTACCGACTG
GATGTCACCA GCGAGCAGCG CTACCACGTC AAGGAAGTGG TGCTGCCGTG GTCGCGCTTC
AGCGGGGTTG ATACGCTGCT TGGCCCAGAG ATGAAGTCAA CCGGCGAAGG CATGGGCAGC
GGCGCGACCT TTGGCGAGGC GTTCGCCAAG GCACAGATGG CGTGCAACAG CCACCTGCCC
ACCAGCGGCA ATGCCTTTCT CAGCGTCAAC GACCGCGACA AGGCCAATTT GTTGCCGATT
GCCCGTGATT TAGCTGCGCT CGGTTTCAAG CTGCTGGCCA CCAGCGGCAC GGCTGCCTTC
CTCCAGCAGC ACGGCCTTGA CGTGAAGCCA ATCTATAAGG TGAACGAAGG TCGGCCCAAC
GCGGTTGACT ACATCAAGAA CGGCGAGATC GCGCTGATCG TGAATACACC TCTCGGCAAG
GCCAGCTTCT TCGATGAAGG GGCCATTCGC CGGGCCGCGA TTGTGTATGG TGTGCCGACG
TTGACGACGC TGTCGGGGGC GGCGGCGGCA GTGCAGGCGA TCCAAGCGAT ACGTGCCGGA
CGGTGGACGT TGCAATCGTT GCAGGAACGG TCGGCGTTGG AAAAGGCGTA G
 
Protein sequence
MPKRTDLHTI LIIGSGPIVI GQACEFDYSG TQACKALREE GYRVVLVNSN PATIMTDPGL 
ADATYIEPLT VPSLERIIAR ERPDALLPTV GGQTALNLAV ALHEAGILDK YGVELIGASV
EAVRIAEDRQ RFKDKMIEIG LQVPRSGTAT TLDEALAVVA QTGFPAIIRP SFTLGGEGGG
IAYNMEEFRA IVERGLDASP VSQVLIEESV LGWKEFELEV MRDRNDNGVI ICSIENIDPM
GVHTGDSITV APAMTLTDRE YERMRDMGLA VLRAVGVETG GSNVQFAVSP TDGRIYVIEM
NPRVSRSSAL ASKATGFPIA KIAAKLAVGY TLDELPNDIT RETPASFEPT LDYVVVKIPR
FTFEKFPQAD QTLTTSMKSV GEVMAIGRTF PEAFQKAWRS LEQGRAGWGA DGHDAIEPER
LRERLITPHP DRMFYVRYAL QSGMSVEQIS ALTKIDPWFI RQLEQLVNLE GRLRAFDLNT
IPPDLLRQAK RMGFSDAQLA HLLRVPPGPQ RWSAELAVRK RRLELGIRPT FHRVDTCAAE
FPAFTPYLYS SYESEDESEP TDRKKVVILG SGPNRIGQGI EFDYCCSHAV FGLRALGYET
IMVNCNPETV STDYDTADRL YFEPLTLEDV LNVVEEERPD GVIIQFGGQT PLKLARALEA
VGVPIWGTMP EAIDLAEDRD RFGALLKELN IPAPEHGSAT SWEEALSVAR RIGYPVVVRP
SYVLGGRAMA IVYDDASLER YMREAVAASP EHPVLIDRFL EDAFEMDVDA VCDGETVVIA
GIMEQIELAG VHSGDSACVI PTYMVAEEHV ATMRRYTEQL ARALGVVGLM NIQYAMKDGV
VYVLEVNPRA SRTVPFVAKA TGVPWAQLAV QCAAGARLRY EHGRIQMVHP ALVNSPRYRL
DVTSEQRYHV KEVVLPWSRF SGVDTLLGPE MKSTGEGMGS GATFGEAFAK AQMACNSHLP
TSGNAFLSVN DRDKANLLPI ARDLAALGFK LLATSGTAAF LQQHGLDVKP IYKVNEGRPN
AVDYIKNGEI ALIVNTPLGK ASFFDEGAIR RAAIVYGVPT LTTLSGAAAA VQAIQAIRAG
RWTLQSLQER SALEKA