Gene Cagg_2777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2777 
Symbol 
ID7269847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3412832 
End bp3414757 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content57% 
IMG OID643567598 
Productprotein of unknown function DUF839 
Protein accessionYP_002464076 
Protein GI219849643 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGTC GTGACGAAAA GGACAAATGG ATTGTCCGCT CGCAGGCTGG GATCGGTCAG 
ACGCTGGAAG AGGTGTTGGC GCTGCGGATT TCGCGCCGTG GCATGCTCAA GACGATGGCG
ATCGGTGGTT CGCTCGTCTT GATCGGTTCG AGTCTGTCGG CTGTCGAAGA GGCACTGGCC
GCGAATCCCG GTTTGAAGTT CAAGGTGGTG AAGCCTACCG ATCCGGACTT TGATGACGTG
GTTGTACCGG AGGGGTACTA TGCGCGTACC CTTATCCGTT GGGGCGAGCC GTTATATGCT
GATGCACCCG ATTTCGATGT CTGGTTACAG ACACCGGAAG CGCAGGTGAA ACAGTTTGGT
TACAACTGCG ATTTTGTTGG TTTCTTGTCG TTGCCCTACG GTTCCAATAA CTCGAACCGT
GGTCTGCTCG TGGTCAATCA CGAGTACACC AACGAAGAGT TGATGTTCCC GAAGTACGAT
GTTGAGAACC CGACACGTAA TCAGGTTGAT GTTGGGATTG CAGCCCACGG AATGTCGGTG
GTTGAAGTGG TGCGCGCGCC TGATGGTACG TGGAGCTATG TACGGAATTC ACGTTACAAC
CGGCGGGTAA GCGGGTTTAG TGCGACCCGC CTGAGCGGCC CTGCCGCCGA TCATCCGTGG
ATGCGCACCA GCACCGACCC CGGCGCCGAT GCGATTCTTG GTACGCTCAA CAACTGCGCC
GGTGGCAAGA CCCCGTGGGG TACGGTGGTG AGCGGCGAAG AGAACTTCCA CCAGTACTTC
GCCAACCTTA ACGGTCTCGA CCGCAACGAC GCTCGCTATG CGGTGCATCG CCGCTATGGT
ATGCCGACGG CTGAATCGGA GCGCAAGTGG GAGCGCTTCC ATAGCCGATT CGATATCTCG
AAAGAGCCGA ATGAGGGTTT CCGCTTCGGC TGGTGCGTGG AAATCGATCC GTATGACCCG
TTACGACCGG TGGTAAAGCG TACAGCGCTG GGACGCTTCC GCCACGAGGG AGCTACCTTT
GTGATTGCGC GCGATGGTCG TGTGGTTGGG TATCAGGGTG ATGATGCTCA GTTTGAGTAC
GTCTACAAGT TCGTCACCAA TGGCAAATTC AACCCGCGCA ATCGCGAAGC AAATATGAAT
TTGCTTGATG ATGGTGTCTT GTACGTAGCG AAGTTTAACG CTGATGGTAC CGGCGAGTGG
TTGCCGTTGG TGTACGGTCA AGGCCCGCTT ACACCGGAAA ATGGCTTCCG CTCGCAGGGT
GATGTGCTGA TCAACACCCG CTTGGCTGCC GATCTCGTCG GTGCAACCAA GATGGATCGA
CCGGAGGACG TAGAAGTCAA TCCGGTCAAC AAGAAGGTGT ACATTGCGTT GACCAACAAC
ACTCGTCGCG GCGCGAGTGG TCAACCGGGC GTTGATGCCG CTAACCCGCG TCCCAACAAT
GCGTGGGGAC ACATCATCGA GCTAACCGAG CGTAACGACG ATCACGCTGC AACCACGTTC
CGTTGGGAAA TCTTCATTCT TGCCGGTTTG CCGACACAAG AACATACCTA CTTCGTCGGC
TTCGACAAGA GCAAGGTCTC GCCGATTGGT GCTCCCGACA ATGTAGCCTT CGACAATCAG
GGCAATTTGT GGATTGCAAC CGACGGCGCA CCGCGTGCCA TCAAGTTCAA CGATGGTCTG
TTCGCTGTGC CGGTGAGCGG TTCACAGCGT GGCAACCTGC AACAGTTCTT CTCGTCGGTG
GCCGGGAGTG AAGTGTGCGG GCCAGAGTTT ACCCCCGACA ACCGCACGCT CTTCCTCGCT
ATCCAACATC CCGGCGAGGG CAGTACCTTC GAGAATCCAA GCAGCACGTG GCCCGACCGG
CAAGGTCTGC CGCGTCCAAG CGTGATCACG GTGCAGCGCT TCGATAGCGG TGTGATCGGG
ACGTAG
 
Protein sequence
MGGRDEKDKW IVRSQAGIGQ TLEEVLALRI SRRGMLKTMA IGGSLVLIGS SLSAVEEALA 
ANPGLKFKVV KPTDPDFDDV VVPEGYYART LIRWGEPLYA DAPDFDVWLQ TPEAQVKQFG
YNCDFVGFLS LPYGSNNSNR GLLVVNHEYT NEELMFPKYD VENPTRNQVD VGIAAHGMSV
VEVVRAPDGT WSYVRNSRYN RRVSGFSATR LSGPAADHPW MRTSTDPGAD AILGTLNNCA
GGKTPWGTVV SGEENFHQYF ANLNGLDRND ARYAVHRRYG MPTAESERKW ERFHSRFDIS
KEPNEGFRFG WCVEIDPYDP LRPVVKRTAL GRFRHEGATF VIARDGRVVG YQGDDAQFEY
VYKFVTNGKF NPRNREANMN LLDDGVLYVA KFNADGTGEW LPLVYGQGPL TPENGFRSQG
DVLINTRLAA DLVGATKMDR PEDVEVNPVN KKVYIALTNN TRRGASGQPG VDAANPRPNN
AWGHIIELTE RNDDHAATTF RWEIFILAGL PTQEHTYFVG FDKSKVSPIG APDNVAFDNQ
GNLWIATDGA PRAIKFNDGL FAVPVSGSQR GNLQQFFSSV AGSEVCGPEF TPDNRTLFLA
IQHPGEGSTF ENPSSTWPDR QGLPRPSVIT VQRFDSGVIG T