Gene Cfla_3421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_3421 
Symbol 
ID9147337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp3803799 
End bp3805037 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content74% 
IMG OID 
ProductHipA N-terminal domain protein 
Protein accessionYP_003638497 
Protein GI296131247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCC GGACGCTGGA TGCCTGGCTC TACGGGACGC TCGTCGCGCA CATCGAGCGC 
GACCGTGACG ACCGCGTCCG CCTGCGCTTC ACCGACGACG CCCTCGACCG CTGGGGGCAC
GGGTCGGCAG TGCTGTCCGG TCTGCTGCCG CTGTCCGACC GCGCATCGTC GCCCGCCGCC
GTCAGCGCCT GGCTGCGTGG GCTCATGCCC GAGGGGCGGG CGCGGAGCCA CCTCGCGCGT
CGTGCCGGTG TCGCCCCGGA CGACGTGGTC GGGTTCCTCG CCGTGCACGG GCGCGACACC
GCCGGCGCGC TCGTGCTCGT CCCCGAGGGC GCCTCACCGG ACCGTCCGCG CGTCCCGCTG
CGCACCCTCG ACGACGACGA GATCGGCGCC CTCCTCGACG AGGCCGCCGA GCAGGGCACG
GCGGACCAAC CGACGTCGAT CGCGGGGCTG GAGTCGAAGA TCGTCCTGAC CGCGACGGCG
CACGGCTTCG CGCTCCCGAC GCCCGACCGT CCGTCCACGC ACATCCTGAA GGTCGCCCGG
CCCGCCGACT CGCGGTCGGC CGACCTCACC GACACCGAGG AGGCGTCCCT CGCGCTCGCG
CGGGGGTGCG GGCTCGGCGA CGTCGAGGCG TGCCACCGCC TGTTCGCCGG CCGACGCGCG
CTCGTGGTGC GCCGGTACGA CCGCGTCGTC GGCCCGGACA CCACCGAGCG GGTCCACCAG
GAGGATGCCG CCCAGCTGCT GGGGCTGGAC ACGACCGACC CGGAGCGCAA GTTCCAGTAC
GGCAAGCGGC GGCCGTCCCT GCTGGAGATC GCCACCCGCC TCGAACGGCT CGGCGTCCCA
CTGGACGGCC TGCTCGCCCT GACGACGTTC AACGTGGCGA TCGGCAACAC CGACGCCCAC
GCCAAGAACC TGTCCGTGCT GCACCTTCCC GACGGCACCC ACCGGTTGGC GCCTGCCTAC
GACGTGGCGA TGCACACGCA CCATGCACAC GCCGAGACGC GCACCGCGAT GGACGTCGAC
GGCGTCCGAG AGATCGACGA CGTGACGTCC GAGCGCCTCC AGGCCGAGGC CGCCGCCTGG
GGGGTCGCCG CCCGCCTTGC CGCGCGGGTC GTGAGGGAAA CCCTCGAGCG ACTCGCCGCG
GCCCTCGACG ACGTGGACCG CGCCATGCAC CCCGGGGTCG ACAAGACGGC ATGGGCGACG
GTGGACGCGC GCGTCGCGCG CCTGCTCGGC TCCGGGTGA
 
Protein sequence
MSVRTLDAWL YGTLVAHIER DRDDRVRLRF TDDALDRWGH GSAVLSGLLP LSDRASSPAA 
VSAWLRGLMP EGRARSHLAR RAGVAPDDVV GFLAVHGRDT AGALVLVPEG ASPDRPRVPL
RTLDDDEIGA LLDEAAEQGT ADQPTSIAGL ESKIVLTATA HGFALPTPDR PSTHILKVAR
PADSRSADLT DTEEASLALA RGCGLGDVEA CHRLFAGRRA LVVRRYDRVV GPDTTERVHQ
EDAAQLLGLD TTDPERKFQY GKRRPSLLEI ATRLERLGVP LDGLLALTTF NVAIGNTDAH
AKNLSVLHLP DGTHRLAPAY DVAMHTHHAH AETRTAMDVD GVREIDDVTS ERLQAEAAAW
GVAARLAARV VRETLERLAA ALDDVDRAMH PGVDKTAWAT VDARVARLLG SG