Gene Franean1_6869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6869 
Symbol 
ID5675182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8374316 
End bp8375245 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content75% 
IMG OID641245718 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_001511109 
Protein GI158318601 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.281308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGTGA CCCGGCGTAG CTCTCCGCCG GCGTCCGGCC CGGTCGGTCC GGTCGATGCC 
CGCGGGGCGT CCGAGGCCGC GGTGCCAGCA ACGGCGCTGG CACCGACGGC GCTGGCCCCG
GCACCGACAC CGACAGCGCC GGCGCAGCGC AGCGGGGGAG CGAGCCTGCC CGGTCGGTAC
GACCGGCAGC TCGACCTGCC GGGCTTCGGC CCGGCGGCCC AGGAGCGGCT GCGCGGCGCG
ACCGTGCTCG TGGCGGGCGT GGGCGGCGTC GGCGGGGCGG CCGCTACCTA CCTCGCCGCC
GCCGGGGTCG GCCGGCTGGT CCTGGTCCAC CCGGGCGCAC TGGAGGAGCC CGACCTCAAC
CGGCAGACCC TGATGCGTCC CGAGTGGATC GGTGGCTCCC GGGTGGCCTG CGCGGAGCAG
ACCCTGCGCG CGCACCACCC GGGAGTCGAG ATCGTCGCGA TCGACCGCGA GCTGTCCCAG
CTCCCCGAGC TGGGCCGCCT CGTCGCCGAG GCGGACGTCG TCGTGGACGC CCGCCACAAC
TTCCCCGACC GCTACCTGCT CAACGACACC TGCGTGGCGG CCCGGACCCC GGCGGTCGTC
GCCGCGATGA ACGCGACCGA GGGCAACATG CTCGTCGTCC GGCCCGGGTC GCCGTGCCTG
CGCTGTGTCT TCACCGAGGG CGACCCGTCC TGGCAGCCGC TGGGTTTCAC CGTCCTCGGC
GCCGTGTCCG GGATGGTCGG CTGCCTCGCC GCGACCGAGG CGATAAAGAT CATATCCGGG
TTCGGTGAGC CCGCGGCCGG CAGGCTGCTC CAGTTCGACC TCTGGGACCT GGACTTCCAG
GTGCTGCGGG CCCGGCGCGA CCCGCACTGC CCGACCTGCG GCGGACCGTC CGGCAGCGGC
CAGCGACCCG GCCCCGGGGA GCCACAGTGA
 
Protein sequence
MIVTRRSSPP ASGPVGPVDA RGASEAAVPA TALAPTALAP APTPTAPAQR SGGASLPGRY 
DRQLDLPGFG PAAQERLRGA TVLVAGVGGV GGAAATYLAA AGVGRLVLVH PGALEEPDLN
RQTLMRPEWI GGSRVACAEQ TLRAHHPGVE IVAIDRELSQ LPELGRLVAE ADVVVDARHN
FPDRYLLNDT CVAARTPAVV AAMNATEGNM LVVRPGSPCL RCVFTEGDPS WQPLGFTVLG
AVSGMVGCLA ATEAIKIISG FGEPAAGRLL QFDLWDLDFQ VLRARRDPHC PTCGGPSGSG
QRPGPGEPQ