Gene Cagg_2082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2082 
Symbol 
ID7267589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2546589 
End bp2548571 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content59% 
IMG OID643566917 
ProductNHL repeat containing protein 
Protein accessionYP_002463406 
Protein GI219848973 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTAC GTTTGCTGAT CGCCTTACTC CTTACGACAC TGATCAACGC TTGTGGGACA 
CCACCGCTTC CAACACCGGA ACCGGTCGCG CTTGGTCCGA CGGCAGTGAT ATTGAACGAA
GCGACCACGT TTAGCGAATT GAACGTGCGG TTACGCTTAC CTGCCGGCTG GCAGAGCCGC
ATCGAGAGTG GGATGTTGCG ACTCGCTCCC AACATGACAA CCCTCGAAGC CGATGTCATC
AATGAGCCGA TGATCCTCCT TGATACCACA TCACTTACCA CCTTGACCAC GCAATACGGT
TCGTCGGCCG CTAACCCGGA AACCATTTTC GAGCTGGCGA GTGGTGCGAT CCAATCGGCC
GGTTATACCA TAGCACCTAC CAAACCGATA CACCTTGGCA ACGCACATGG CGTAGTTGCC
GACATTACCG GACCGACCAG TACCGGTCGA TTACTCGTCC TCATTGACGA GACCCGTGCA
GTACGCATTT TGGTACAGGC GGCAAACGAC CAGTGGGTAC GATCACAAGC GCTGATCGAC
AGCATACTGG CAACCATCGA ACTGCTCCCC GTACCATCCC CTACACCTAC CCCGACCAAT
CTTGCCGCCC AACCACAAAT TGTGCGCTCT GGACCACCGG GCTTTGTGAT GCGGATCGGT
GGGCGGAGTG GCCCGGCCAA CAGCCGCTTC ATTGCCGCCC GCGGCTTAGC CGCCGCACCC
GATGGGACGA TCTACTTGGC CGAAAGCGGA CGTGGGGTCT GGGTCTTTGC CCCCGACGGG
ACATTACGCC AGACGTTCGG CGCCGATGAG CTACTCGACG CCTACGACGT AGCCCTCGGC
CCTACAGGCG ACATCTACGT CGCCGATTAT GGTCGTAACG CTATCGTCCG TTTTAGCAGC
GATGGCACCT TCCTCAGTCG ATGGGGCGGC CATGGCGACG CACCTGACCA ATTTGGGCTT
TCAGCACCCC AACGGATTGC AGTGGGGAAT GACGGCAGTG TCTACGCGCT CGATACTCGT
CCTGGTGCGG ATGGGCTAGC CGCAAGTAGT ATTGTGCGTT TCAGTGGTGA AGGGCGCTTC
CTTGAACGGA TCGAACTACC ACCCGATTTA GCGCCGGCCG ATTTAGTCGT CGACCCCGGT
GGTGTCATCT ATCTGGCCGA GAATTTTGCC GGCGTGATCG TTAAGCTTGC CCCCGACGGT
ACAGTTATCG CCCGCTTGGG CGATCCGGCC GATCCTACGC AATTCGCCGG ACCGGTACTC
GATCTTGATC GGGCCGGTTA TCTCTATCTT GCCACCTATA CCGGCATCAT CTTACGACTG
GCGCCCGACG GAACGATTGT CGCACGCGGG GGTAGTCCGG CTACCCCCGG CAGCCTGCCG
AACCCCGGAG AGATCAGTCT GCCCAACGGA ATTGTGGCTG CACCCGGTGG TGTTGTATGG
GTGAGCGACA ACAGTGGTGA GTACAGCGCA ATCTCGGCAT TTCGGCTCCA AACCGACGCC
GCGGCCCTAG CCACGGCAAT GGCACTCACG CCTACCGCCC TCACAGTGGT CGAAACAGCG
CAGCAGTGGG CGGTTGCGGC TACCGCCAGC AGCTTCTACG CTCCCGACTA CGATCCTGAC
GGCGTCATCG GCCCACCCAA CGTACCTGGC TGCCAAGACA GTCCTGACGC TTGGGCGCCG
GCCATCCCCG GCAGCCGTGA AACCCTCACC GTCACCTTTG CCGAGCCAAT GTTTGCCAGT
GCTCTGACCA TTTATCAAAA CCACCAACCC GGATACATCA CGCATGTCGA ACTTATTGAT
GAGCAGGGCA CTGTGCGAAC AGTCTACCGC GCCGACCCCA CCCCTGCGCC AGAGTGTCCG
TTTGTCACCA CGATCACCTT CGAGCAAACA CTCACACGTA TTGTTAAGGC GCAAATCACG
CTTAATCAGC GGGATGGCAG TTGGAGCGAG ATCGATGCGG TGGCCTTAAT CGGCATACCC
TAA
 
Protein sequence
MRLRLLIALL LTTLINACGT PPLPTPEPVA LGPTAVILNE ATTFSELNVR LRLPAGWQSR 
IESGMLRLAP NMTTLEADVI NEPMILLDTT SLTTLTTQYG SSAANPETIF ELASGAIQSA
GYTIAPTKPI HLGNAHGVVA DITGPTSTGR LLVLIDETRA VRILVQAAND QWVRSQALID
SILATIELLP VPSPTPTPTN LAAQPQIVRS GPPGFVMRIG GRSGPANSRF IAARGLAAAP
DGTIYLAESG RGVWVFAPDG TLRQTFGADE LLDAYDVALG PTGDIYVADY GRNAIVRFSS
DGTFLSRWGG HGDAPDQFGL SAPQRIAVGN DGSVYALDTR PGADGLAASS IVRFSGEGRF
LERIELPPDL APADLVVDPG GVIYLAENFA GVIVKLAPDG TVIARLGDPA DPTQFAGPVL
DLDRAGYLYL ATYTGIILRL APDGTIVARG GSPATPGSLP NPGEISLPNG IVAAPGGVVW
VSDNSGEYSA ISAFRLQTDA AALATAMALT PTALTVVETA QQWAVAATAS SFYAPDYDPD
GVIGPPNVPG CQDSPDAWAP AIPGSRETLT VTFAEPMFAS ALTIYQNHQP GYITHVELID
EQGTVRTVYR ADPTPAPECP FVTTITFEQT LTRIVKAQIT LNQRDGSWSE IDAVALIGIP