Gene Cfla_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2501 
Symbol 
ID9146405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp2803178 
End bp2804461 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGlycine hydroxymethyltransferase 
Protein accessionYP_003637588 
Protein GI296130338 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACGTGCTCGA CCAGAACCTC TCCGAGCTCG ACCCGGAGAT CGCCGCGGTC 
CTCGACCGCG AGCTGGCCCG CCAGCAGCAC ACCCTCGAGA TGATCGCGTC CGAGAACTTC
GTGCCGCTCG CCGTCCTGCA GGCGCAGGGC TCGGTGCTGA CCAACAAGTA CGCCGAGGGC
TACCCGGGCC GCCGCTACTA CGGCGGCTGC GAGGAGGTCG ACGTCGCGGA GACCATCGCG
ATCGAGCGGG CCAAGGCCCT GTTCGGCGCG GAGTTCGCGA ACGTCCAGCC GCACTCCGGT
GCCACCGCGA ACGCGGCGGT GCTGCACGCC ATCGCGCGCC CCGGCGACAC GATCCTCGGC
CTGGCGCTGG ACCAGGGCGG CCACCTCACG CACGGCATGA AGATCAACTT CTCCGGTCGG
CTCTACGACA TCGTCGCCTA CGGCGTGGAC CCCGAGACGT CCCTCGTGGA CATGGCCGAG
GTCCGCCGCC TGGCCCTCGA GCACCGGCCC AAGGTCATCA TCGCCGGCTG GTCGGCGTAC
CCCCGTCAGC TCGACTTCGC GAAGTTCCGC GAGATCGCCG ACGAGGTCGG CGCGTACCTG
TGGGTCGACA TGGCGCACTT CGCCGGCCTC GTCGCCGCAG GCGTGCACCC GAGCCCCGTC
CCGCACGCGC ACGTGGTGTC GTCCACCGTG CACAAGACGA TCGGCGGCCC CCGCTCGGGC
TTCATCCTCA CCAACGACGC CGACCTCGCG AAGAAGATCA ACTCGGCGGT CTTCCCCGGC
CAGCAGGGCG GCCCGCTCAT GCACGTCATC GCCGCCAAGG CCACGGCGTT CAAGGTCGCC
GGCACCCCGG AGTTCCGCGA CCGCCAGGAG CGCACCCTGC GCGGTGCGCG GATCGTCGCC
GAGCGCCTGA GCCGGCAGGA CGCGAAGGAC GCGGGTGTCG CGGTGCGCTC CGGCGGCACC
GACGTGCACC TCGTGCTGGT CGACCTGCGC GAGTCGCCCC TGGACGGCAA GCAGGCCGAG
GACCTCCTGC ACTCCGCCGG CATCACGGTG AACCGCAACG CGGTGCCCAA CGACCCGCGC
CCGCCGATGA CCACGTCGGG CCTGCGCATC GGCACCCCGG CGCTCGCCAC TCGCGGCTTC
GGCGACGAGG AGTTCACCGA GGTCGCGGAC ATCATCGCCG AGGCGCTCAT CGGTGGTGTC
GACGCCGATG TCGAGGCGCT GCGCGCCCGG GTCAAGGTGC TCACCGAGCG CTTCCCGCTG
TACCCCGGCC TCCGGCAGTA CTGA
 
Protein sequence
MSDNVLDQNL SELDPEIAAV LDRELARQQH TLEMIASENF VPLAVLQAQG SVLTNKYAEG 
YPGRRYYGGC EEVDVAETIA IERAKALFGA EFANVQPHSG ATANAAVLHA IARPGDTILG
LALDQGGHLT HGMKINFSGR LYDIVAYGVD PETSLVDMAE VRRLALEHRP KVIIAGWSAY
PRQLDFAKFR EIADEVGAYL WVDMAHFAGL VAAGVHPSPV PHAHVVSSTV HKTIGGPRSG
FILTNDADLA KKINSAVFPG QQGGPLMHVI AAKATAFKVA GTPEFRDRQE RTLRGARIVA
ERLSRQDAKD AGVAVRSGGT DVHLVLVDLR ESPLDGKQAE DLLHSAGITV NRNAVPNDPR
PPMTTSGLRI GTPALATRGF GDEEFTEVAD IIAEALIGGV DADVEALRAR VKVLTERFPL
YPGLRQY