Gene Franean1_5340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5340 
Symbol 
ID5673674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6437373 
End bp6438449 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID641244198 
Productextracellular solute-binding protein 
Protein accessionYP_001509604 
Protein GI158317096 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.927258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACACC GTCACGCGCG CACGCCAGAC CCGTCCGGTC TGTGCTCAGG GAAAGGGAGT 
CTGATGGGCA GGGCGAGAAG AGCGCTGGCC GTGCTGGCGA CGGCCATCCT CGTGGGGGGC
CTGTCGGCGG CGTGCGGCGG GGATGACGGG AAGACGCTCA CGCTCTACAA CGCGCAGCAT
CGGGACCTGA TGCAGGTGCT GGTCGACGCG TTCACCAAGG AGACCGGCAT CAAGGTCGAG
ATGCGTAACG GCGGTGACGC GGAGCTTGCG AACCAGATCG TCCAGGAGGG CGACAGCTCG
CCCGCGGATC TGTTCGCCAC CGAGAACTCG CCGGCCATGA CGCTGGTCGA CCGGGCTGGC
GGGTTCAGCC CGCTCGACCA GGCCACCCTC GACCAGATGC CCGACCAGTA CGTCCCGAGC
TCCGGCACCT GGGTCGGCTT CGCGGCGCGG TCGACGGTGT TCATCTACAA CCGCGACCAG
GTCGACAAGG ACGCGCTACC AACGTCGATC ATGGATCTGG CCCGGCCGGA GTGGCAGGGT
CGAGTGGGCG TCGCGGCCGG TGGCGCCGAC TTCCAGGCCA TCGTCAGCGC TGTGCTCGCG
GTGGAGGGCG AGGACGCTGC CGCCGACTGG CTCGCCGGAC TGAAGCGCAA CGCCAAGATC
TACGACAACA ACATCGCCGC GCTGCGTGCC GTGAACGCCG GCGAGGTGCC CGCCGCCGTG
ATCTACCACT ACTACTGGTA CCAGGACCAG GCGGAGTCGG GAGAGATCAG CAGGAACGTC
GACCTGCACT TCTTCGGGAA CCAGGACGCG GGCGCGTTCC TCAGCGTCTC CGGCGTCGGC
GTGATCGCGG CCAGCGACCA GCAGGCCGAG GCGCAGCAAC TGGTCAGGTT CCTCACCAGC
GAAGCCGGGC AGCGGGCGCT CGTCGACAGC GGCGCCCTGG AGTACGCCGT GTCGGACAAG
GCCCCCACAA ACCCCGCGCT GACGCCCCTG GCGGACCTCG ACGCACCGCA CATCGACATC
TCGACCCTGA ACGGCCCGAA GGTCATCGAG CTGATGCAGC AGGCGGGTCT GCTCTGA
 
Protein sequence
MPHRHARTPD PSGLCSGKGS LMGRARRALA VLATAILVGG LSAACGGDDG KTLTLYNAQH 
RDLMQVLVDA FTKETGIKVE MRNGGDAELA NQIVQEGDSS PADLFATENS PAMTLVDRAG
GFSPLDQATL DQMPDQYVPS SGTWVGFAAR STVFIYNRDQ VDKDALPTSI MDLARPEWQG
RVGVAAGGAD FQAIVSAVLA VEGEDAAADW LAGLKRNAKI YDNNIAALRA VNAGEVPAAV
IYHYYWYQDQ AESGEISRNV DLHFFGNQDA GAFLSVSGVG VIAASDQQAE AQQLVRFLTS
EAGQRALVDS GALEYAVSDK APTNPALTPL ADLDAPHIDI STLNGPKVIE LMQQAGLL