Gene Cagg_3683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3683 
Symbol 
ID7268218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4475838 
End bp4477166 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content57% 
IMG OID643568489 
ProductNusA antitermination factor 
Protein accessionYP_002464955 
Protein GI219850522 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00388019 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCG ATTTTTACGC GGCAATTACT CAGATTGCGT CTGAACGCGG TATTCCAAAA 
GAGGCCATCA TTGACGTGAT GGAACGGGCA TTGGTGAGCG CATACAAGCG CTTACTCGGC
CCCAATCCAC CGGCGGTAGA AGTGACGGTG AAGCTCGATC CCGTCAGCGG TATGGCGCGG
GTCTACGCCG AGAAGCAAGT GGTCGATGAA GTGTACGACG AGCGGTTTGA GATCGACCTC
AACAGTGCGC GCCAAATCAA GCCCGACGTT CAGATCGGCG AGACGGTGCT GGTCGAGAGC
ACACCGCGTG ATTTTGGCCG GATCGCCGCC CAAACTGCGA AGCAAGTGGT GTTGCAAGGG
ATTAAAGAGA TCGAACGGAG CTATATCTAT AGCGAATTTG AAGATCGCGA AGGCGAGTTG
GTGACGGCGA CGGTACAGCG CAACAACGGG CCACGCGGCA ACGTGATCCT TGAGATAGGG
AAGGCCGAGG CCATTATCCC GCCGAAAGAG CAAGTGGCAA ACGATCGCTA TTACCACGGC
CAGCGGCTCA AGGTATTGCT GCTCGAGGTC AAGAAGGATG ATCGCGGCCC TCGCCTGATT
GCGTCGCGCG CGCACAAGAA TCTGATCAAC CGCCTGTTTG AGATGGAAGT TCCCGAAATC
TACAACGGGT CGGTTGAGAT TAAGTCGGTG GCCCGTGAAC CCGGTCTGCG CACGAAAGTC
GCTGTTGCGG CCCGCCAAGA GGGGATCGAT CCGGTCGGGT CATGCGTCGG CATGCGTGGT
ATCCGCATTC AGAATATTGT TAATGAGCTA AACGGCGAAA AGATCGATGT GGTGCAGTGG
TCGTCGGATC CACGTGAGTA TATCGCCAAC GCCCTCTCGC CGGCACAAGT GGTTGAGGTG
CATCTCCACG ATCACGACCA TACCGCGCTG GTGATTGTCC CCGACAAACA ACTTTCGCTG
GCAATCGGTA AGGAAGGGCA GAACGTCCGG CTGGCAGCGA AGCTCACCGG TTGGCGGATC
GATATTAAGA GTGCGTCGGC GTTGCTCGAA GAGGAACGGG CCGCTGCTGA GGCCCGCGCT
GAAGCTGAGG CCGAACAGAT GGTGATTGCA GCCGAATTGG CGCATGCCCC GGTTGAACAG
CGCATCGTAC AGGCTGACGG CACCATCCGC TACCGCGGTC ATCGCTACGG CCCACTCGGT
AACGATTTGG TTGGACAGAC GGTGCAGGTA CGGGCAACAT CGCAAAAGAT CTTTATTTAC
TATAATGACC GCCTGTTCGC CTCCTATATC TTGCTGGAAG GTGGACAGGC TTCGCCTGAT
GAGGAGTAA
 
Protein sequence
MKSDFYAAIT QIASERGIPK EAIIDVMERA LVSAYKRLLG PNPPAVEVTV KLDPVSGMAR 
VYAEKQVVDE VYDERFEIDL NSARQIKPDV QIGETVLVES TPRDFGRIAA QTAKQVVLQG
IKEIERSYIY SEFEDREGEL VTATVQRNNG PRGNVILEIG KAEAIIPPKE QVANDRYYHG
QRLKVLLLEV KKDDRGPRLI ASRAHKNLIN RLFEMEVPEI YNGSVEIKSV AREPGLRTKV
AVAARQEGID PVGSCVGMRG IRIQNIVNEL NGEKIDVVQW SSDPREYIAN ALSPAQVVEV
HLHDHDHTAL VIVPDKQLSL AIGKEGQNVR LAAKLTGWRI DIKSASALLE EERAAAEARA
EAEAEQMVIA AELAHAPVEQ RIVQADGTIR YRGHRYGPLG NDLVGQTVQV RATSQKIFIY
YNDRLFASYI LLEGGQASPD EE