Gene Cagg_1978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1978 
Symbol 
ID7268894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2416214 
End bp2417629 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content56% 
IMG OID643566813 
Producthypothetical protein 
Protein accessionYP_002463306 
Protein GI219848873 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.014731 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGTA TTGAGGCAAT GAAGCCGGCA TTTTCAGCCT TCGCCCTTCA GTGGCGGCAT 
TTGTTCGCCC AATATGTTGC TGAATGGCGA ATTATCCGCT GGCTCGCACC GATCGTTCTG
ATACACTCGC TCGTGTACGT CTTTCTCGTA CCGCCGTGGC AGCACTACGA CGAACCGGGC
CACTTTCTCT ACGCAGCCTA CATTGTTCGT GGAGGGATCG CAGCACCAGA CAATGTCGCG
ATTGCCCGTG AAGTGGCCGA TTCGATGTAT CGCCATCATT TTTGGCCGCC GGATGTGCGA
CCCGACCTGC TCAGCCCACG GCCACCAGCT ATCCCGACCG ATCAACGTCA CCATCCACCA
CTTTATTATG TACTTATGGC CGGTATCCTC GGCCCGCTGC GCTATCTGCC GGTTGAACTG
CAACTATATG CCGGACGACT CGTCAGTGCG GGGTTGATGA TGTTGACCGT CCTCGCCGTT
TGGCGCACGG TACGGATTAT GGCTCCCGAC GAGCCACATA TGGCAATCGT ACTGGCAGCA
CTGGTAGCTA TGACACCGGC GTTCGTCGAT TTGATGAGTG CATTCAACAG TGATGTGCTC
ATGAATTGGG CCGCGGCAGT CGCTTTTTTA GGCTTTGCCC TTCTCTTGCG CAACGGTTGG
CAGCCAACCG GTATTACCTT GGCTGTGCTC GGTACTCTCG TCGCGATTCT GACGAAACGT
ACCGCGGTAC CGTTAATCGG CCCACTGGTA GTAGTGCTGG TATGGACGGC ATATCGGCGT
CCAATTCGGT GGTGGTGGTA TGGCTTGAGC GGCCTTGCGA TCATCACGTT AGCGGTATTG
AGCAGTTTTT CATTCACCGG CGGTGAGCTG CGGGTACAAC CGTGGCTCGC CACCCTTGAA
CGCGACTATT TGCGCGTACC GATCATTCCG TGGCTAGAAT CGTGGCTCAA CTGGGAGCGC
GCGTGGCCGT GGTACCTGCG CACGCTCGAG GTAGCGCATA GCCACTTTTG GATGCGGTTG
GCATGGGGAC ACGTTGCCGT TTTACCTCCG GTGGGCGACT GGCTCGTGGT CGGTGTAAGT
ATTGCCGCAA TCCTCGGCCT GTTCCGCGGT ATTCGCAGTT GGTCAACAAC GCTCACCCTT
GATCAACAAC GTTGGATCTG GCTTTGTCTG CTGGCAGTGG GTATCGCATG GCTCGCGTTG
TTCGGTCGTC TCCATCCGCT ACCAGAAACC GGCAACACCT ACATCCCGCG TGGTCGTTAC
TTGTATTGGG CACTGTTGCC GACGATGTGG CTCCTTCTGG TCGGTTGGCA ACATCTCTGG
CCCGAGCGAT GGCGGCCACT GACGTGCTAC ATACTCATCG GACTATTTGC CGCATTTGAT
ATGATTGCAA TAGTGACGAT TGTCCGTCAG TTGTAG
 
Protein sequence
MARIEAMKPA FSAFALQWRH LFAQYVAEWR IIRWLAPIVL IHSLVYVFLV PPWQHYDEPG 
HFLYAAYIVR GGIAAPDNVA IAREVADSMY RHHFWPPDVR PDLLSPRPPA IPTDQRHHPP
LYYVLMAGIL GPLRYLPVEL QLYAGRLVSA GLMMLTVLAV WRTVRIMAPD EPHMAIVLAA
LVAMTPAFVD LMSAFNSDVL MNWAAAVAFL GFALLLRNGW QPTGITLAVL GTLVAILTKR
TAVPLIGPLV VVLVWTAYRR PIRWWWYGLS GLAIITLAVL SSFSFTGGEL RVQPWLATLE
RDYLRVPIIP WLESWLNWER AWPWYLRTLE VAHSHFWMRL AWGHVAVLPP VGDWLVVGVS
IAAILGLFRG IRSWSTTLTL DQQRWIWLCL LAVGIAWLAL FGRLHPLPET GNTYIPRGRY
LYWALLPTMW LLLVGWQHLW PERWRPLTCY ILIGLFAAFD MIAIVTIVRQ L