Gene Cfla_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3663 
Symbol 
ID9147579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp4047764 
End bp4049140 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content72% 
IMG OID 
Productmercuric reductase 
Protein accessionYP_003638733 
Protein GI296131483 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGA ACACCTGGGA CCTGGTCGTG GTCGGCACCG GGGCAGCGGC GATGGCCGCC 
GGTATCGAAG CCCGCTCGCG CGGCAAGCGC GTCCTGCTCG TCGAGCACGG CCCGCTCGGC
GGGACGTGCC TGAACATCGG GTGCATCCCG AGCAAGAACC TGCTCGCCGC CGCCGGGCAG
CGCCACCGCG CCCTGGCCAA CCCCTTCCCG ATGGTCCCCA CCACCGCCGG CGAGGTCGAC
GTGCCCGCGC TGATGGGGCG CAAGCAGGAC CTGATCGACG GGCTGCGGCA AGCCAAGTAC
GAGGACGTCG CCGCCGCGCA CGGCTTCCCG ATCCGGCACG GGCACGCACG GTTCGTCGAC
GAGGCGACGC TGCACGTCGA GGACGAGCCG GTCCGGGCGG CGGCCTACGT CATCGCCACG
GGCGCCGCGC CGCACCTGCC GGACCTGCCC GGACTGCACG ACGTCGCCTA CCTCACCTCG
ACCACCGCGA TGGAGCAGCA GCAGCTGCCC GCGTCGATGG TCGTCATCGG CGGCGGCTAC
GTCGGGCTCG AGCAGGCGCA GCTGTGGTCC CACCTCGGAG TGCACGTGAC CCTGATCGGC
CGAGTCGCAC CGCACACCGA ACCCGAGGTC GCCGACGTGC TGCGCGCCGC GTTCCTCACC
GACGGCATCC AGCTCCTCGA GGAGCACGCC GTCGCCGTCG AGCGCGGGGC CGACGACACA
GTCCTCGTGC ACACCGCCAG CGGCCGAACG GCGAACGGCG AACGGCTGCT GGTCGCGACC
GGCCGGGCGG CCGACACCAC CGGCCTCGGC CTCGACGACG CCGGCGTCGC GACGGACGCC
CGCGGCTTCA TCGTCGTCGA CACCCACCAG CGCACCACCA ACCCGCGCAT CTACGCCGCC
GGCGACGTCA CCGGAGCACC TCAGTACGTC TACGTCGCCG CGCGGACCGG ACACGCGGCG
GCCGCTGGCG CCCTCGGCGA CCCCACCGCG GTCGACTACC GCGGCCTTCC CGGCGTCGTC
TTCACCACCC CGCAGCTCGC CTCGGCCGGA CTCACTGAAC AGCGCGCCCT CGAACTCGGG
CACACCTGCG ACTGCCGGGT CCTCACAGCT CAGGACATCC CCCGCGCCCT GGTCAACCAA
GACCCACGAG GCGTCCTGAA GCTCGTCACC GACGCCCACA CCCGCCAGAT CCTCGGCGTC
CACGCCGCAC TCGACGGCGC CGGGGAACTC ATGCTCGCCG CCACCTACGC CATCAAGTTC
GGCCTCACCA TCGACGACAT CGCCGACACC TGGGCGCCCT ACCTCACGAT GAGCGAAGCG
CTGCGCCTTG CCGCCGGACT CTTCCGCACC AACATCCCAA CCAGCTGCTG CGCCTAA
 
Protein sequence
MDENTWDLVV VGTGAAAMAA GIEARSRGKR VLLVEHGPLG GTCLNIGCIP SKNLLAAAGQ 
RHRALANPFP MVPTTAGEVD VPALMGRKQD LIDGLRQAKY EDVAAAHGFP IRHGHARFVD
EATLHVEDEP VRAAAYVIAT GAAPHLPDLP GLHDVAYLTS TTAMEQQQLP ASMVVIGGGY
VGLEQAQLWS HLGVHVTLIG RVAPHTEPEV ADVLRAAFLT DGIQLLEEHA VAVERGADDT
VLVHTASGRT ANGERLLVAT GRAADTTGLG LDDAGVATDA RGFIVVDTHQ RTTNPRIYAA
GDVTGAPQYV YVAARTGHAA AAGALGDPTA VDYRGLPGVV FTTPQLASAG LTEQRALELG
HTCDCRVLTA QDIPRALVNQ DPRGVLKLVT DAHTRQILGV HAALDGAGEL MLAATYAIKF
GLTIDDIADT WAPYLTMSEA LRLAAGLFRT NIPTSCCA