Gene Francci3_3475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3475 
Symbol 
ID3905209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4144294 
End bp4145337 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content72% 
IMG OID637880797 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_482557 
Protein GI86742157 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAAGG TAGTCGTGAC GGGTGGGGCC GGCTTCATCG GGGCCCACCT GACCAGAGCG 
CTCCTCGCGG CAGGCACGGA GGTCGTCGTG ATCGACGATC TCAGTACCGG GGCGCTGTCG
AATCTCGCGG GGCTGCCGGC CGAGCTCGTC GTCGGCAGCG TGACCGACCG GACGCTCGTC
GAACAGGCGT GCACCGGGGC GTCGAGCATC GTGCATCTGG CTGCCCGGCC GTCGGTCGAA
CGGTCGTTGC TCGACCCGAT GGCCACCCAC GCGGTGAACG CGACCGGCAC CCTGACGGTG
CTCGGCGTCG CCCACCGGGC CGAGACGCAC GTCGTCGTCG TGTCCTCCTC ATCGGTTTAT
GGTGACCGGT CGGCCGCTGG CGATCGGTCG GCCGCCGCTG GCGCGGGTCC GCTGTCGCCC
TCGGCGGGCA CCCCGTGCCT TCCGCGCAGT CCCTTCGCGG CCTCGAAGCT CGCCGCCGAG
GGATATGCGC TGTCCTACCA GGCCAGTTTC GGTCTGCCGG TGCTCACGGT CAGGCTGTTC
GACGTGTTCG GCCCGTACCA GTCCGCCGGG CACGCGTACG CGGCCGTGGT GCCGACCTTC
ATCGAGGCCG CGTTGGCCGG CCGGCCGCTG ACGGTGCGCG GGGACGGCCG GCAGACGCGC
GATTTCATCC CTGTCGAGCT GGTCACCGGG ATGCTGTGCG ATGCGGTGTG CCGCCGGCTG
ACCCATCCAC ACCCGGTCGA CATCGGGTCC GGGACCCGTA CCGATCTGCT CACCCTGATC
GCCCGGCTGG AGGAGATTCT CGGCCGGCGG CTGGTCGTCG AGCACGCTGC GCCCCGGCCC
GGGGAGATCT GGGACTCCCA GGCGGACACG ACGACGATGC GTTCGCTGTT CCCGGACGTG
ACCGGGGCGG ATCTCACCAC CGCGTTGGCG GCAACCGTGA CCTGGTACGC CGACCGGCTG
GGGGCGGACC GCGCCGGGCC GCCAGCCGCC GGGCCGCCGG ACGCGATGCT GCCCGCGTCC
GCCGTCCACG GTGACCGGGA TTGA
 
Protein sequence
MVKVVVTGGA GFIGAHLTRA LLAAGTEVVV IDDLSTGALS NLAGLPAELV VGSVTDRTLV 
EQACTGASSI VHLAARPSVE RSLLDPMATH AVNATGTLTV LGVAHRAETH VVVVSSSSVY
GDRSAAGDRS AAAGAGPLSP SAGTPCLPRS PFAASKLAAE GYALSYQASF GLPVLTVRLF
DVFGPYQSAG HAYAAVVPTF IEAALAGRPL TVRGDGRQTR DFIPVELVTG MLCDAVCRRL
THPHPVDIGS GTRTDLLTLI ARLEEILGRR LVVEHAAPRP GEIWDSQADT TTMRSLFPDV
TGADLTTALA ATVTWYADRL GADRAGPPAA GPPDAMLPAS AVHGDRD