Gene Franean1_3045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3045 
Symbol 
ID5671424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3578882 
End bp3579868 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content64% 
IMG OID641241943 
Productperiplasmic binding protein 
Protein accessionYP_001507363 
Protein GI158314855 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTA CCAGACCACG GCGCCGCAGT GCGGCCCTGG TCGCCGCGTT GCTGGGCACC 
GTGATCCTCC TTGCGGGGTG CGGCAGCGAC GACAGCGACG ATCAAGGTGG CGCGGTCGGA
GCGACCCGCA CTGTGGAGGC CGACAACGGC GCGGTCGAGA TTCCGGCGCA CCCGCAGCGG
ATTGCAACGC TCGGGAGACT AACCGTGTCG TTCCTCGACC TCGGCGGCGA GCCAGTGGGC
GTCACGGAGG TGGACGCTTC CGTGCTCGAC GTACTGCCCG AGGAGCAGCA GGCCGCGTAC
AAGGCGGCCA AGCTCCTCGG CTCCGGCGCC AGCGAAGCCG ACCTCGAGCT GCTGGCCACC
CTCAAGCCCG ACCTCATCTT GTTCTCCGCA CCTGACTCCG ACTTCGAGCA GATGAAGTCG
CAACTGGAAT CGATCGCACC GACGATCTTC TTCGGATTCA GCTCGGACTG GAAGACCCGC
CTGTCCGTGA CCGCGGATGC CACTGAGTTG ACAGATGCTC TCAACGAGCA GAAGACCGAG
TATGAGGAGA AGCTCGCCGA GTTCCAGAAC AAGTACCCGG AGATCATAAA GACCACCAAG
TTCGGCGAAG TCAACAGAGG TTCTTGGCAG GACGCAGGAA TGTTCACCCT CAACGGCTCG
CAGTGCTCGG AGATAGCGCG AGCGGACATT CCCCTCGACA TACCCGATCT GGGCGAAGGG
GGCGAGGAGC GATCGTTCGA GCAGATCGGC GGCCTGTCCG AGTACGACGT GCTCCTGTAC
CCCGTGGACG CTGAGGGTAA GGTCACGGAA GGCTTCGCCC CCGTGGCGGA ATCGGGCGCA
TGGAAGGCCC TTCCCGCCGT GACCTCGGGC AAGGCCCTGG GTGTCTACTG CTTCGGCGAT
GTCAGCTTCA CCAGATCCTA TCGGACCTAC TCTCAATACC TGGATTCGCT CGGCCAGGCG
CTGGCGAAGC TCGCGACGGC GGGATGA
 
Protein sequence
MTFTRPRRRS AALVAALLGT VILLAGCGSD DSDDQGGAVG ATRTVEADNG AVEIPAHPQR 
IATLGRLTVS FLDLGGEPVG VTEVDASVLD VLPEEQQAAY KAAKLLGSGA SEADLELLAT
LKPDLILFSA PDSDFEQMKS QLESIAPTIF FGFSSDWKTR LSVTADATEL TDALNEQKTE
YEEKLAEFQN KYPEIIKTTK FGEVNRGSWQ DAGMFTLNGS QCSEIARADI PLDIPDLGEG
GEERSFEQIG GLSEYDVLLY PVDAEGKVTE GFAPVAESGA WKALPAVTSG KALGVYCFGD
VSFTRSYRTY SQYLDSLGQA LAKLATAG