Gene Cagg_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1670 
Symbol 
ID7268972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2038013 
End bp2039899 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content54% 
IMG OID643566512 
Producthypothetical protein 
Protein accessionYP_002463007 
Protein GI219848574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCAC GCCTTTACCT CATAGTTATA CTGGTACTCA CGTTGGTACC AATCAGCGCG 
GCTGCCCAAC AACCATCGCC AATCAGCATC ACCGTACAGG TAGGACTTGA CGGCGAGGGA
AGTTTTCGGC CCAAATACTG GGTACCGGTC TTCGTCACAC TTGCCAATGA CGGGCCAGAC
CAGCAGATCA AGCTCGAATG GCGCGACCAG AACACAGGGT CTTTCACCCA AAGCTATGTG
CTTGATCTAC CCGGCGGTGC GCGCAAGCAG ATCGTGTTAC CGGTTATCCA AACCTCTCGT
AGCGCAATCT TGACTGCGAC GGCAAACGGC GTACAGGTGT TTCGCGAACG GATTTTCCTC
AAGCAACTGC CTGATGATCA GATAGCAATC GGCTTGCTAA GTACCGATCC CACTGTGTTG
AGTAGCTTGA CGATTGCCGA TTTCGGAGCG ACTCGCGGCG CCACGATTAT CCCACTGACA
CCGGCGTTGC TGGTCGATAA CCCGCTGCTG CTGACGGCAA TTGATGTGAT CGCCGTGCGC
GAACTCACCG CCGAACTACG TCCAGAACAG CGGGAGGCTC TGATCACATG GGTGCAACAG
GGCGGCACCT TGCTGATCGG CGGTGGAGCA GTCGGCGAAA CGGCCATCCG CACCTTTGCC
GATATGCTCC CGGTCACCGT TGGGCCGCTC CAAGGGAATT GGCCGGTCAA CACTTTAGCA
CAGCTTATCG GTTTGAGTGG GTTAAGCAAC AGCGTCCCCC AACTTACGGC ACATACCGTC
ACGTTACGGG CAAATGCCCA TGCACTGACC AATGATACGC TGATTAGCCA GATGGAGCTA
GGAGCGGGAA AAATCATCTT CGCCGCGTTC GATCTTGCCA CATTGCGGGC CTGGCCGGGT
GAGGCCAAAC TGTGGGCGAA GGTTCTCGCG CTTCAACCCC GGATTGACAT CGGGGCAACG
TTCCGTTTTA GTTTTAACGA TCTGCTACAA AGTAGTTTGA ATCAACCACT GTTTGAGCTA
CCATCAACGA TGGTGATGCT CGGCCTTATC AGCTTGTACA TTATCGTGAT CGGGCCGCTT
CACTTCTTCA TTTTACGTCA ACTACGTCGG CTCGAATGGG CATGGCTGAC CACACCACTG
CTGATCGTTA TCTTTCTGCT CGGCACTTAT GGCATGAGCT TCGCCCTTCG TGGTACCCAA
ACGCAGATCG TTCAACTCAC CATTGTACAA ACCACGGCTA AAAGTGAGAC GGCCATCACA
ACAACGTTTG CCGGCATATT TTCCCCACAG CGGAGCCGTT ATACGCTGAC TGTCACCGAT
ACGGCCTTCG TTACCCCAAT GCGTACCGAT GTCGGGCCGG TTGAGACACA ACGCGACGAC
AACGCAGTGA CCATCCCCGA CCTTCAACTC GATGCGTCAG CATTTCAGAC ATGGATCGCC
GAGGAAGGAG GACCCAATCC GGTACAGATC GGTGCGCAAA TCACGCGCGA GGGCCAGGCT
TGGAATGGAA GTGTAACCAA TATCGGCGAA TTCCCACTCC GCGATGTCAT GGTGGTCTGG
CAGAACAATA TGCAATGGAT CGGTGACTTG CCAGCCGGCG CTGAGGCAAC GATCACCCTC
AACCCTAATC AAGGTAATTT CCTACGCGAA TTCATCCCCA ACGATCAGAA TAGTTTACTG
AACCACACGT TTGTGCTAGA GAACTTGTTT TGGTATAGTC AGACAACGAA CCGATTTACA
CCACCTAACG AACCACCTAG CATGCCCGAT ACCAGGATGT ACCTGATCGG CTGGAGTGAG
CAAGTGACGC CGGTATTCCA GATCGACGGT GTCGCGACCC GGACCCGTGG TGAGACGTTG
TATATCGTGG CCCTACAGCA ACCGTGA
 
Protein sequence
MRARLYLIVI LVLTLVPISA AAQQPSPISI TVQVGLDGEG SFRPKYWVPV FVTLANDGPD 
QQIKLEWRDQ NTGSFTQSYV LDLPGGARKQ IVLPVIQTSR SAILTATANG VQVFRERIFL
KQLPDDQIAI GLLSTDPTVL SSLTIADFGA TRGATIIPLT PALLVDNPLL LTAIDVIAVR
ELTAELRPEQ REALITWVQQ GGTLLIGGGA VGETAIRTFA DMLPVTVGPL QGNWPVNTLA
QLIGLSGLSN SVPQLTAHTV TLRANAHALT NDTLISQMEL GAGKIIFAAF DLATLRAWPG
EAKLWAKVLA LQPRIDIGAT FRFSFNDLLQ SSLNQPLFEL PSTMVMLGLI SLYIIVIGPL
HFFILRQLRR LEWAWLTTPL LIVIFLLGTY GMSFALRGTQ TQIVQLTIVQ TTAKSETAIT
TTFAGIFSPQ RSRYTLTVTD TAFVTPMRTD VGPVETQRDD NAVTIPDLQL DASAFQTWIA
EEGGPNPVQI GAQITREGQA WNGSVTNIGE FPLRDVMVVW QNNMQWIGDL PAGAEATITL
NPNQGNFLRE FIPNDQNSLL NHTFVLENLF WYSQTTNRFT PPNEPPSMPD TRMYLIGWSE
QVTPVFQIDG VATRTRGETL YIVALQQP