Gene Franean1_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1966 
Symbol 
ID5670367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2362451 
End bp2363407 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content68% 
IMG OID641240887 
ProductABC-2 type transporter 
Protein accessionYP_001506309 
Protein GI158313801 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1682] ABC-type polysaccharide/polyol phosphate export systems, permease component 
TIGRFAM ID[TIGR00025] ABC transporter efflux protein, DrrB family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.115517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG CCGAGACGAC CACCGCCGGA ACAGGCACCG CCGGAACAGG CACCGCCGGA 
ACAGCCACCA CCAGCGCAGG CACCGCCGGA ACAGGCGCCA CGGGCACGGG CTCCGGTGCG
GACACAAGGC CGCCGGCGGC CGGGGCGGCC GTGCCGCCGG ATCTGCGCGC GGTGCTCGCG
ACCGGTGCCC GCCCGGCGCG GCCCACCCCG CTGGCGGCCT CGCTCACCTT CGACTGGCGG
GCCCTGCTGA AGATCAGACA TGTGCCCGAG CAGCTCTTCG ACGTGACCGT CTTCCCGATC
ATGTTGACCC TGATGTTCAC CTATCTGTTC GGCGGCGCGC TCGCCGGGTC GACGCAGGAG
TACGTACAGT TCCTGCTGCC CGGAATCCTC GTCCAGGCGA TCGTGATGAT CACGGTTTAC
ACCGGGGTGA CCGTCAACAC CGACATCACC AAGGGTGTCT TCGACCGGTT GCGGTCACTG
CCGATCTGGC AGCCGTCCGC ACTCGTCGGA GCGCTGCTGG GCGACGTGTT CCGCTATTCG
ATCGCCGCCG TCCTCATCCT CGCGCTGGGG CTGGCGATCG GTTTCCGGCC GGAGGGGGGC
GCGCTCGGCG TCCTGGCCGC GGTGGCCGTC GTCATCGCCT TCTCGTTCAG CCTCACCTGG
GTGTGGACGG TGCTGGCGAT GGTGCTGCGC ACCCCGAACT CGGTGATGGG CGTGAGCATG
ATGATTCTTT TTCCACTGAC CTTCGTCAGC AACATCTTCG TGCGGCAGGA GACGCTGCCC
GGGTGGCTCC AGGCCTTCGT CGACGTCAAC CCGATCACCC ACACGACGAA CGCCTCCCGC
GGGCTGATGC ACGGCGTCGC CACGGCTGAG CAGCTCGGAT GGGTCGCGCT GTCGTGCGCG
CTTCTGCTCA TCGTGTTCGG CCCCCTGACG ATGCGGATGT ATCGCGGCCG GAGTTGA
 
Protein sequence
MTAAETTTAG TGTAGTGTAG TATTSAGTAG TGATGTGSGA DTRPPAAGAA VPPDLRAVLA 
TGARPARPTP LAASLTFDWR ALLKIRHVPE QLFDVTVFPI MLTLMFTYLF GGALAGSTQE
YVQFLLPGIL VQAIVMITVY TGVTVNTDIT KGVFDRLRSL PIWQPSALVG ALLGDVFRYS
IAAVLILALG LAIGFRPEGG ALGVLAAVAV VIAFSFSLTW VWTVLAMVLR TPNSVMGVSM
MILFPLTFVS NIFVRQETLP GWLQAFVDVN PITHTTNASR GLMHGVATAE QLGWVALSCA
LLLIVFGPLT MRMYRGRS