Gene Franean1_6990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6990 
Symbol 
ID5675301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8512519 
End bp8513601 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content74% 
IMG OID641245836 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001511227 
Protein GI158318719 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.81769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTGC GGGTGCCCGT GATCGCGCTG ACCGGATACC TGGGTGCCGG CAAGACGACG 
GTGCTCAACC ACCTGCTCCA GGCCCCTGGG GCACGCCTTG GGGTCGTGGT CAACGACTTC
GGGGCGATCA ACGTGGACGC CGCGCTGGTC TCCGGTCAGG TGGACCAGCC GGCCTCGATC
GCGGGCGGCT GCCTGTGCTG CCTGCCGGAC ACGGACGGCC TGGACCAGGC GCTGGAGAAG
CTGAGCCATC CCCGGCTGCG GCTGGACGCG GTGATCGTGG AGGCCAGCGG CGTCGCCGAC
CCGCCGGCGC TGGCCAGGCT CATCCGGTTC AGCGGTGTGG ACCGCGTGCG CCCCGGCGGT
CTCGTCGACG TGATCGACGC CCCCGCCTAC TTCGACACCG TCGACACGGG CGGGCTGCCG
CCGGCCCGGT TCGCGTCCGC CTCGCTCGTC GTCATCAACA AGACCGACCG GATCCCGCCG
GCGCGGCGTG CCGAGACGTT GGCGCGGATC ACCGGCCGGG TGCGCGAGAG CAACCCGCAC
GCCCACATCG TCGACACGAC GCACGGCCGC GTCGACCCGG TGCTCGTGTT CGACGCCGCG
AACCCGTACG ACCCGGTCGA CGAGCTCCCG CTCGCGGCCC TGGCCCGGCA CGACCACGAA
GACGGTCACG ACCCGCACCC GCGGGTCGAC GCGGTGACCG TTCCCGCCGC CGGTCCGATC
GATCCCGGCC CGCTGGTCGA CCTGCTCGAG GATCCCCCCG CGAACGTCTA CCGGCTCAAG
GGCACCGTGA CCGTGGAGAC GGCGCGGGGA CCGCGCGGCT ATGTGGTCAA CGTCGTCGGA
CGGGAGATCA ACGTCGCGAC CAGACCCGGC GCTGTCAGAC CCGGCGCTGC CAGGCCCGGC
ACTGCCAGGC CCGGCACTGC CAGGCCCGCG GCGGACGATG CCAGCGGTCT GGTCGCGATC
GGCATGCGTC TCGACCAGGC CGCCGTCCGC GCCCGTCTCG AGGCGGCCCT CCAGCCGTGC
CCCGGTCGCC CCGCCGCGGA CGGGGTCCGC CGCCTCGCCC GCTACCGGCG CCTGAGCACC
TGA
 
Protein sequence
MVVRVPVIAL TGYLGAGKTT VLNHLLQAPG ARLGVVVNDF GAINVDAALV SGQVDQPASI 
AGGCLCCLPD TDGLDQALEK LSHPRLRLDA VIVEASGVAD PPALARLIRF SGVDRVRPGG
LVDVIDAPAY FDTVDTGGLP PARFASASLV VINKTDRIPP ARRAETLARI TGRVRESNPH
AHIVDTTHGR VDPVLVFDAA NPYDPVDELP LAALARHDHE DGHDPHPRVD AVTVPAAGPI
DPGPLVDLLE DPPANVYRLK GTVTVETARG PRGYVVNVVG REINVATRPG AVRPGAARPG
TARPGTARPA ADDASGLVAI GMRLDQAAVR ARLEAALQPC PGRPAADGVR RLARYRRLST