Gene Cfla_2947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_2947 
Symbol 
ID9146859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3269671 
End bp3270930 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content73% 
IMG OID 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_003638029 
Protein GI296130779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCG TCCTGCCCCC CGGCCCCGGG GACTGGGACG ACCACCGCCT CGACCGCGCC 
GAGCTGCGCC CCGACACCCT CGCGGTGCGC GGCGGCCTCG TGCGCACGCC GTTCGGCGAG
ATGTCCGAGG CCCTGTTCCT CACGCAGGGG TACACGTACG CCACCGCCGC GCAGGCCGAG
GCCGGGTTCG CGGGCGAGGT CGACCGGTTC CTGTACTCGC GGTACGGCAA CCCGACCGTC
ACGACGTTCG AGGAGCGGTT GCGCCTGCTC GAGGGCGCCG AGGCCTGCTT CGCGACGGCG
ACCGGCATGT CCGCCGTCTT CACCGCCCTC GCCGCGCTCG TGCGGTCAGG GTCGCGCGTC
GTCGCCGCGC GGGCGCTGTT CGGCTCGACC GTCGTGATCC TCGACGAGAT CCTCGCGTCG
TGGGGCGTGC GTACCGACTA CGTCGACGGC CACGTGCCCG AGCAGTGGGA GCAGGCGCTC
GCGACGCCCG CCGACGTGGT CTTCTTCGAG ACCCCGTCCA ACCCGATGCA GGACCTCGTC
GACATCGCGG CGGTCAGCCG GCTCGCGCAC GCCGCCGGGG CCACGGTCGT CGTCGACAAC
GTCTTCGCCA CGCCCGTCTT CTCCCGCCCG CTCGACCACG GCGCCGACGT CGTCGTGTAC
TCCGCGACCA AGCACATCGA CGGTCAGGGG CGCGTGCTGG GCGGCGCGAT CCTCGGCTCC
GCGGAGTACG TGCGCGGCCC TGTGCAGACG CTCCTCCGCC ACACCGGCCC GTCGCTGTCA
CCGTTCAACG CGTGGGTGCT GCTCAAGGGA CTGGAGACGC TGTCCTTGCG CGTGCGGCAC
CAGGCCGGCT CGGCGCTGGA GCTCGCACGC TGGCTCGAGG AGCAGCCGGG AGTCGCACGC
GTGCGCTACC CGTTCCTCGC GTCGCACCCG CAGCACGACC TGGCGCGCGC GCAGCAGACG
GGCGGCGGCA CGGTCGTGAC GTTCGACCTC GACGTGCCCG CCGACGCGAC GCCCGACGTG
GCGAAGAAGG CGACGTTCGG CGTGCTGGAC GCGTTGCGGG TGGTCGACAT CTCCAACAAC
CTCGGGGACA CCAAGTCGAT CGTCACGCAC CCCGCGACGA CGACGCACCG CCGGCTCGGC
CCGGCGGGAC GTGCCGCGGT CGGCATCGCC GAGACGACGG TGCGCCTGTC GGTCGGGCTG
GAGGACGTCG AGGACCTGCG CGACGACCTC GCGCAGGCGC TCGGCACGCT GGGCGGCTGA
 
Protein sequence
MSTVLPPGPG DWDDHRLDRA ELRPDTLAVR GGLVRTPFGE MSEALFLTQG YTYATAAQAE 
AGFAGEVDRF LYSRYGNPTV TTFEERLRLL EGAEACFATA TGMSAVFTAL AALVRSGSRV
VAARALFGST VVILDEILAS WGVRTDYVDG HVPEQWEQAL ATPADVVFFE TPSNPMQDLV
DIAAVSRLAH AAGATVVVDN VFATPVFSRP LDHGADVVVY SATKHIDGQG RVLGGAILGS
AEYVRGPVQT LLRHTGPSLS PFNAWVLLKG LETLSLRVRH QAGSALELAR WLEEQPGVAR
VRYPFLASHP QHDLARAQQT GGGTVVTFDL DVPADATPDV AKKATFGVLD ALRVVDISNN
LGDTKSIVTH PATTTHRRLG PAGRAAVGIA ETTVRLSVGL EDVEDLRDDL AQALGTLGG