Gene Franean1_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3201 
SymbolmoaA 
ID5671577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3779991 
End bp3780986 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID641242095 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_001507515 
Protein GI158315007 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTCG CCGACTCGTA CGGGCGAGTC GCGACCGACC TACGGGTGTC GCTGACCGAC 
CGTTGCAACC TGCGTTGCAC CTACTGCATG CCCGCCGAAG GCCTGGCCTG GCTGCCGCGG
CCGGAAATCC TCACGGACAG TGAGGTGTTA CGCCTGGTCG CGATTGCGGT GACAAGGCTC
GGGGTGACCG AGATCCGGTT GACCGGGGGC GAGCCCACAC TGCGGCCGGG GCTGGTGTCC
CTGGTGCAGG CGATAACCGC CCTGGTACCG CGGCCGGAGG TGGCGCTGAC GACGAACGGC
CTGCTTCTGG GCGGCTCCGG AGGGCTGGCC GGAGCACTGG CGGCGGCAGG CATGGACCGG
GTGAACGTGT CGCTGGACAC GCTGCGTCCG GACCGGTTTG GTGAGATCAC CCGCCGTCAC
CGCCTGGACG ACGTGTTCGC GGGTCTGGAA GCCGCGGCAC GGGCGGGATT CGCCCCGGTG
AAGGTGAACG CGGTGCTGAT GCGGGGAGTC AACGACGACG AGGCCGTCCC GCTACTGCAC
TGGTGCCTGG ACCGAGGCTA CGAGCTGCGG TTCATCGAGC AGATGCCGCT CGACGCCCAG
GGCGGCTGGC GACGGGAGCA GATGGTGACC GCGGCGGAGA TCTTGGACCG GCTGGCCGCG
GAGTTCACCC TCACCCCCGC ACCCGGGCGC GGCAACGCGC CGGCCGAACT TTTCACGATC
GACGCGGGCC CCGGGCAGGT CGGGGTGATC GCCTCGGTGT CGGCGCCGTT CTGCGCGGCG
TGTGACCGAG TCCGGCTCAC AGCTGACGGG CAGGTACGCG ACTGCCTGTT CGCCCGGACC
GAGTCGGATC TGCGGACCCC CCTGAGGTCT GGGGCTGACG ACGAGGAGAT CGCGGCCCGG
TGGGTGCGGG CGGTGCGGGC GAAACGGGCC GGGCACGGCA TCGACGTCCC CGGATTCGTT
CAACCGGCTC GTCCCATGTC CGCCATCGGC GGGTGA
 
Protein sequence
MQLADSYGRV ATDLRVSLTD RCNLRCTYCM PAEGLAWLPR PEILTDSEVL RLVAIAVTRL 
GVTEIRLTGG EPTLRPGLVS LVQAITALVP RPEVALTTNG LLLGGSGGLA GALAAAGMDR
VNVSLDTLRP DRFGEITRRH RLDDVFAGLE AAARAGFAPV KVNAVLMRGV NDDEAVPLLH
WCLDRGYELR FIEQMPLDAQ GGWRREQMVT AAEILDRLAA EFTLTPAPGR GNAPAELFTI
DAGPGQVGVI ASVSAPFCAA CDRVRLTADG QVRDCLFART ESDLRTPLRS GADDEEIAAR
WVRAVRAKRA GHGIDVPGFV QPARPMSAIG G