Gene Cagg_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2114 
Symbol 
ID7267621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2600385 
End bp2602349 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content57% 
IMG OID643566947 
Producthypothetical protein 
Protein accessionYP_002463436 
Protein GI219849003 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.325526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCA TCATCTTGTT GAGCGTCATC ATGTCCCTGC TTGCTTGGGC ATCACCGGTA 
ACTGCTCAAT CAGCCACGCC AATCTGTTTT CCCGATGTGC CGTTGATCGA TAATTGTCTG
CACCCATCGT TCGCTACCTA CTGGCGTGAC AATGGTGGTT TGCCGGTGTT TGGTTACCCG
ATCAGTCCGC TGGAACAGTT TGTTGGCGAT GAAGGCCGTA CCTTAACCGT CCAATGGACA
GAACGAAACC GCTTGGAATT GCATCCGGCC AACCCGCCGG CCTATCGCAT CCAGATCGGG
CGCATGGGAG CCGAGGCGTT GGCGCGGGCC GGTCGTGATC TGTTTGCCGA CCCACCCGAC
CCCGGCCCGC AACCCGGTTG CTTGTGGTTT CCAGAGACCG GTCATAATGT GTGTGATCAA
GAGCCGGGCA ATGGGTTTCG CACCTATTGG CAGAATAATG GTCTGCGCAT TCCCGGTCTT
AGCCGTTATG CCCAATCACT GGCGTTGTTT GGCTATCCCC TCACCGCGCC GCAGATGGAA
CGGAATGCAA ATGGCGATCT GGTGTTGACC CAGTGGTTCG AGCGGGCCCG TTTCGAATGG
CATCCACAAA ATCCGGCTCA TTCGCGGGTG TTACTGGGGC TGTTGGGGCG TGAGTTATAT
ACCTCACCAC AGCCATTGCC CGATCTGCGG GCCGGGCGGT CGGTGTTCGG CGTCGAGATC
AATCGCGGTA TGGTCGCCGC TACCGCCACG CAGTTGGCCG AACTGGGAGC TGATTGGGTG
CGCTACAACG GCATTCGCTG GGATGAGGTC GAGCCGAACC GTGGGGTGCG TAATTGGGCA
GCGTTGCAGG AGGTTGAGAC CGAGTTGCGC CTGATTAGCG CAAGCGGTGC TGTACCGATG
GTCATTGTGC GTGGTACACC GACGTGGGCG CAGGCCCAGT CTCGTTCAGC ATGTGGTTCG
ATCCGGTCAG ATGCGCTGCC TGAGTTCGCC GTTTTTCTTA CCGAACTTGT GACACGCTAC
AGCCGGCCAC CGTACAATGT CAAGTTTTGG GAATTGGGCA ATGAACCTGA TGCCCCGTTT
CAGCTTGTTG GTAGTGATGC GCCGTTTGGT TGTTGGGGCG ATGAAAGCGA TCCCGATAAT
TACGGTGGTG CTGCCTACGC CGCAATGCTT AAAGTGGCCT ATCCGGCGAT CAAAGCTGCC
GATCCGCAGG CACAGGTGAT TTTTGGTGGT TTACTGCTCG CATGCTCACC GGATCATGCC
GTTCGCAATA ATGAACCATG TAATGCTGGA CGCTTCTTTG AAGGGGTGCT GCGCGCCGGG
GGTGGTGATT ACTTCGATAT TCTGGCGTAT CACGCTTACG CGTACTTTGC CGTTCATCGC
GATCCTGACC GTGAGCATCC GGCGTGGGCT GATGGAGGTG GTGCAACGCT GGGTAAACTT
GTTTACCTGC GGTCGGTGCT TGCTCATTAT GGCTACGCTA AGCCGGTGAT CATGAACGAA
GGGGCGTTGC TCTGTTACCG TAGCTCGCCG AATTGCCGGC CCAACGGTTT CGAGGAGGCA
AAGGCCGATG CAGTAGCGCG GCTGTATGCC CGCACGTTGG CTGCCGATCT GCTGATGTCG
CATTGGTATA CGCTCAACGG GCCCGGCTGG CAAGAGGGTG GCTTGCTCGA TGATCGGCAG
CAGCCGCAAC CGGCGTTTCG GGCTTTCCGG TTTTTGCGTC AGCAACTTGG TGAGGCACGG
TATGTGCGTG CAGTGGATGA TACCGGTTTG GAGGGTTATC GTTTTCGCAC GCCGAGCGGT
GATGTGATCG TGGTGTGGAG CAGCGATGGC AAAGAACGAA CGTTTGTGTT GCCGGCGCCA
CCGCGAGCTA TCTACGATGC AACGGGACAA GAACTGCCGA TCACCGATAT CTTGACCCTC
ACCATTGCAC CGCGGATTAT TCGACTTGCC GAGACGTCAC CATAA
 
Protein sequence
MQLIILLSVI MSLLAWASPV TAQSATPICF PDVPLIDNCL HPSFATYWRD NGGLPVFGYP 
ISPLEQFVGD EGRTLTVQWT ERNRLELHPA NPPAYRIQIG RMGAEALARA GRDLFADPPD
PGPQPGCLWF PETGHNVCDQ EPGNGFRTYW QNNGLRIPGL SRYAQSLALF GYPLTAPQME
RNANGDLVLT QWFERARFEW HPQNPAHSRV LLGLLGRELY TSPQPLPDLR AGRSVFGVEI
NRGMVAATAT QLAELGADWV RYNGIRWDEV EPNRGVRNWA ALQEVETELR LISASGAVPM
VIVRGTPTWA QAQSRSACGS IRSDALPEFA VFLTELVTRY SRPPYNVKFW ELGNEPDAPF
QLVGSDAPFG CWGDESDPDN YGGAAYAAML KVAYPAIKAA DPQAQVIFGG LLLACSPDHA
VRNNEPCNAG RFFEGVLRAG GGDYFDILAY HAYAYFAVHR DPDREHPAWA DGGGATLGKL
VYLRSVLAHY GYAKPVIMNE GALLCYRSSP NCRPNGFEEA KADAVARLYA RTLAADLLMS
HWYTLNGPGW QEGGLLDDRQ QPQPAFRAFR FLRQQLGEAR YVRAVDDTGL EGYRFRTPSG
DVIVVWSSDG KERTFVLPAP PRAIYDATGQ ELPITDILTL TIAPRIIRLA ETSP