Gene Franean1_5939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5939 
Symbol 
ID5674260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7215934 
End bp7216926 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content67% 
IMG OID641244787 
Productperiplasmic binding protein 
Protein accessionYP_001510189 
Protein GI158317681 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.587353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.774281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGGTA TAAGAAAATC GCTCGCGGCG ATATGCGTCG CGGCGGCCGT CATGGTCGGA 
ATCGTGGCTT GCGGCAGCGA TGACGACGGG GGCGCGTCCC AGGAGGGGGG GACCGCGGCG
GCGGCCGGGT TCCCGGTCAC GATCAAGCAT GCGTTCGGAT CGACGACCAT TCCCGCGGAG
CCGAAGAAAA TCATCGCGCT GAGCTACGAG GAGGACACGC TCGCGGCGCT CGGAATCACG
CCGATCGCCT ACGGTCAGAA CCCCGGCAAG CCGGACGGGG ACTTCCCCTG GCTCGAGGGC
AGGATCGACC TCGCGGACAG CACGGCGCTC GACACCAGCG GCGACCTCAA CCTCGAGCAG
ATCGCCGCAC TCGATCCCGA CCTGATCCTC GCGACGAACT TCTACGGGCT GGCGGACTAC
TACGACCGCC TCAGCGAGAT CGCGCCGACC GTCGCCTACG AGACCGACGC CGGTATCTCG
ACCTGGCAGG ACGTGAGCAC GGTGATCGGG AAGGCGGTTG GTCGTGAGGC CGACGTCGCG
AAGGCGATCG AAGCCACCGA GAAGATCGTC TCCGACGCGG CCGCGGAGCT GCCCGGGCTG
GCGGGCAAGA CGTTCAGCTA CAGCTACTAC TACGAGTCCA ACGGGCTCGC CGTGATCGAC
GACCCGGAGA CGATCTCCAT CCAGCTCTAC GGCCAGCTCG GGATGAAGCT GTCGCCGCGG
GTCACCGCGA GCGTCGTCGA CCGCGCGCTG AGCATGGAGA AGATCGGTGA GCTCGACGCC
GACTTCATGA TGATCGGCTT CGCGACCGAC GAACTGCGCA CCGAGATGAA GGCCAACGAG
CTGTACACGA GGATCCCGGC GGTCATGGAC GGCCGGGCGC AGGAGGTCGA CGCCTTCACG
GCGGGTGCGG TCAACAACCC GACCATCCTG AACATCCCCT GGCAGCTCGA GCAGCTCAAG
CCCACCCTGG CCAAGGTCGC GGCGGCGGGC TGA
 
Protein sequence
MRGIRKSLAA ICVAAAVMVG IVACGSDDDG GASQEGGTAA AAGFPVTIKH AFGSTTIPAE 
PKKIIALSYE EDTLAALGIT PIAYGQNPGK PDGDFPWLEG RIDLADSTAL DTSGDLNLEQ
IAALDPDLIL ATNFYGLADY YDRLSEIAPT VAYETDAGIS TWQDVSTVIG KAVGREADVA
KAIEATEKIV SDAAAELPGL AGKTFSYSYY YESNGLAVID DPETISIQLY GQLGMKLSPR
VTASVVDRAL SMEKIGELDA DFMMIGFATD ELRTEMKANE LYTRIPAVMD GRAQEVDAFT
AGAVNNPTIL NIPWQLEQLK PTLAKVAAAG