Gene Cagg_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1403 
Symbol 
ID7267255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1728620 
End bp1729636 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content59% 
IMG OID643566246 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_002462746 
Protein GI219848313 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0067064 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC TCCCCTCACT ACGACTGGTT GCCCTCGGTG GCGGTGGCGG TAGTGCCCAG 
ACCCTTCTCG GTGCTCGCCC GTTCTTCGCC GAACGTACTG CGATCATTGC CGTAACCGAT
TCAGGTCGCA GTACGGGTAT AGCACGTCAG ATCGCCGATA TTCCGGCCCC CGGTGATCTG
CGCAACGTAC TGGCTACATT GGCGGCTGCA CCAAACCAAA CGCTCGCCCG CTTAATGCAA
TACCGACTTA AAAGCCCGTC CTTTCCCTTC CTGGATGGAA TGGCAATAGG GAATCTCATG
TTGGGCGGAC TCGTACAGAT GGAAGGAGAC ATCGCCGCCG CCGCAGCGAC TGCGCGCGAG
CTGTTGGGTT GCCCAGAACA CATTTTGCCC GTTTCGACGG CCAATACCCG TTTGTGTGCC
GAGCTGGCCG ACGGACGCAT TGTCGTCGAG GAGGTTGCCG TGCGGACACC TGGCAAGCCA
CCGATTCGCC GCCTCTTTCT CGACCCACCG GCAGCCGCTT ACCCCCCTGC GCTTACCGCA
TTGGCAACCG CCGATCTGAT CGTGATCGGC CCCGGTAGCT TCTATACATC ACTTCTGGCA
ACACTCTGCT TTGATGGTAT CGTCGAAACC CTGCGCACCA CACCGGCAAC CATCGTATTT
GTTTGCAACA CAACAACTCA GCCCGGTCAG ACCGATGGTA TGACAATAGC CGACCACGTG
GCCCGCCTTG TCGAAGTTCT CGGCCCTGGT GTGCTTGACG TGGCGTTGAT TAACGATGCG
AACGCTCTCC ATCCGGCAAT TGTCGCTCGC TATCAAGCTG CGGGCCTGCA TCCACTCTCG
CTGACCGATA CCGACCGACA GGCGATTCGT GCCCTCGGCG TCGAGCCGCT CGTGCGCGAT
TTAGCCGAAC CTGATCCCGG CCACCGTGAA CTGTGGCAAA AAGCCGATAC CATCCGTCAC
GATCCACAAA CGCTTGGCTT GGCCCTGTGG AAGCTGGCGC TTGATCGGAT GCAATAA
 
Protein sequence
MTNLPSLRLV ALGGGGGSAQ TLLGARPFFA ERTAIIAVTD SGRSTGIARQ IADIPAPGDL 
RNVLATLAAA PNQTLARLMQ YRLKSPSFPF LDGMAIGNLM LGGLVQMEGD IAAAAATARE
LLGCPEHILP VSTANTRLCA ELADGRIVVE EVAVRTPGKP PIRRLFLDPP AAAYPPALTA
LATADLIVIG PGSFYTSLLA TLCFDGIVET LRTTPATIVF VCNTTTQPGQ TDGMTIADHV
ARLVEVLGPG VLDVALINDA NALHPAIVAR YQAAGLHPLS LTDTDRQAIR ALGVEPLVRD
LAEPDPGHRE LWQKADTIRH DPQTLGLALW KLALDRMQ