Gene Cagg_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1920 
Symbol 
ID7268835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2353331 
End bp2354371 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID643566757 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_002463251 
Protein GI219848818 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.892292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000398816 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTGACC GTGTGTTAAT TACCGGTGGT GCCGGATTTC TCGGCATTAA TTTAGCGCGT 
TATTTGTTGG CGCGCGGCTA TATCGTGCGT TCACTCGATA TTGCGCCTTT CGACTACCCT
GAGCGGAATC AAATTGAAGA GCATACCGGC GATATTCGTG ATCGGGCTGC CGTTGATCGG
GCAATGCAAG GGGTGAGGTT TGTTGTCCAT ACAGCGGCTG CACTTCCGCT CTATTCACCG
GCCGACATCT TCTCGACCGA TATTGATGGG ACACGCAACG TCCTCGAATC GGCCCGTGAT
CACGGCGTCG AGCGGGTAGT CCATATTTCG TCAACGGCAG TGTACGGTAT TCCCGACCAT
CACCCGCTGG TAGAAACCGA CCCGCTCAGT GGCGTGGGTC CGTATGGTGA GGCTAAAGTC
AAAGCTGAGG AGCTATGTCT CGAATTCCGC AAGGCCGGGA TGTGTGTACC GATCTTGCGA
CCCAAGTCGT TTGTCGGCCC TGAGCGACTC GGTATTTTTG CGATGCTGTA CGATTGGGCA
ATGGAAGGAC ACAACTTCCC GTTGCCCGGA AACGGCAAGA ATCGCTACCA GTTGCTCGAT
GTCGAAGACC TCTGTGAAGC AATCGTGCTC TGTCTGACGC TCGATCGCGA TCGGGTCAAT
GACACCTTCA ACATCGGCGC GAAAGAGTTT ACCACGATCA AAGAGGATTT TCAGGCGGTA
CTCGATGCAG CCGGCTATGG CAAGCGCATT ATCACCTTCC CGGCCAAGCC GATGGTGTGG
GCACTGGCGA TCCTCGAAAA ACTGAAGCTG TCGCCGGTCT ACAAGTGGGC GTATGGTACC
GTCACCGAAG ATTCGTTTGT GTCGGTCGAA AAGGCCGAGC GAGTGTTAGG CTTTACGCCC
AAGTATTCCA ACAAACAGGC ACTGGTCCGC AACTATCAGT GGTATGTTGC AAACGCCAAG
AAATTCGGTC AGCAGACCGG TGTCTCGCAC CGAGTGCCGT GGAGTCAAGG GATTTTGCGG
CTGGCGAAGC TATTCTTCTA A
 
Protein sequence
MADRVLITGG AGFLGINLAR YLLARGYIVR SLDIAPFDYP ERNQIEEHTG DIRDRAAVDR 
AMQGVRFVVH TAAALPLYSP ADIFSTDIDG TRNVLESARD HGVERVVHIS STAVYGIPDH
HPLVETDPLS GVGPYGEAKV KAEELCLEFR KAGMCVPILR PKSFVGPERL GIFAMLYDWA
MEGHNFPLPG NGKNRYQLLD VEDLCEAIVL CLTLDRDRVN DTFNIGAKEF TTIKEDFQAV
LDAAGYGKRI ITFPAKPMVW ALAILEKLKL SPVYKWAYGT VTEDSFVSVE KAERVLGFTP
KYSNKQALVR NYQWYVANAK KFGQQTGVSH RVPWSQGILR LAKLFF