Gene Cagg_0080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0080 
Symbol 
ID7266818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp112820 
End bp114058 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content59% 
IMG OID643564953 
Producthypothetical protein 
Protein accessionYP_002461469 
Protein GI219847036 
COG category[S] Function unknown 
COG ID[COG5368] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.815437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGAAA CAACCTTATG CGAATTTGAA ACTGAAGGGG CACTGGCTTT TTTCCGGGCC 
GGCACTAATC TCGATCCGGC CAGCCCCGGT TATGGCCTGA CAGTTGATGC CATCCATCGT
CCACGCATCG CTTCGATTGC TTCGGTTGGG TTTGCCTTAA GCGCATGGGT CATCGCAGTC
GAACGCGGAC GGATGAGCCG CGCCGAGGGG CTGGCGATTA CCCGCGGCAC GCTCCGCACG
CTCTATGAGC GCGTGCCGCA CCAATACGGC TTCTTTGCCC ACTTTCTCGA TCGCTATACC
GCAGCCCGTT GGCAACACTG CGAATACTCG ACGATCGATA CCGGCCTTTG CCTGAACGGA
GTGATTACCG CAGCCGCCTA TTTTCGCGAC GCCGAGATTG ATGAGCTAGC AATGCGCCTG
CTCGACCGGA TCGATTGGAA GGCGTTTATC ACCGAACGCG CCGGCCATAC CGTCTTGCGG
ATGTCGTATA ACTCTGACGC CGATGGCGAT TATGTAACCG ATACACCGGG GTTCATTAGC
TATTGGGATA TGGCCGCCGA GCAGAAGCTG CTTTACATCC AAGCGGCACT CTGTGTACCG
GCAAACACCG CTCGCGCGCT CTACCGCGGC TTTCGCCGTG ACATCGGCGT CTACCAAGGT
CAGCCGGTGA TTATTAACCC CGGCGGCACG CTCTTCGCGT ACCAGTACAC CGAAGCGTGG
CTAGACACCC GCAGCTATCG CGATCCCGAT GGCGTTGATT GGTTCAACAA CGCGCGGCTA
GCGGCGCTGG TCAACCGCGA CTTTTGCCTC AGCTTGCGCG ATCAGTTTCG CACCTATCAC
GAGCGGAGCT GGGGCATTGG CTGCGGCGAC ACCCCACGCG GCTACGTCGT TGCCGGCGCA
CCGCCGGCGC TGGCACCGAT CGAACCCGAC GGCACGGTCT CGATTAGCAA TGCCACTGCC
TGTATGCCAT TCATACCTGC TGAAGTACCG GCGATGCTTG ACTACCTCTA CCACGAGCAG
CCGCAAACTT GCGGGCCATA CGGCTTTTAC GATGCCTATA ACCTCGCGGT GAAGCCGCCG
TGGTATAGCC GGACGATTTA CGGGATTAAC AAAGGGTGTG CATTGCTGAT GCTCGAAAAT
GCGCGCAGCG GCCTGATTTG GGATGTCTAT TCGTCGAGTT TGCCGATCCA ACGGGCGCTG
AACGTGTTGG GGTTCACCAA GCACGAGAAA GCGCACTGA
 
Protein sequence
MDETTLCEFE TEGALAFFRA GTNLDPASPG YGLTVDAIHR PRIASIASVG FALSAWVIAV 
ERGRMSRAEG LAITRGTLRT LYERVPHQYG FFAHFLDRYT AARWQHCEYS TIDTGLCLNG
VITAAAYFRD AEIDELAMRL LDRIDWKAFI TERAGHTVLR MSYNSDADGD YVTDTPGFIS
YWDMAAEQKL LYIQAALCVP ANTARALYRG FRRDIGVYQG QPVIINPGGT LFAYQYTEAW
LDTRSYRDPD GVDWFNNARL AALVNRDFCL SLRDQFRTYH ERSWGIGCGD TPRGYVVAGA
PPALAPIEPD GTVSISNATA CMPFIPAEVP AMLDYLYHEQ PQTCGPYGFY DAYNLAVKPP
WYSRTIYGIN KGCALLMLEN ARSGLIWDVY SSSLPIQRAL NVLGFTKHEK AH