Gene Franean1_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0721 
Symbol 
ID5669137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp838110 
End bp839147 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content72% 
IMG OID641239648 
Productalkanesulfonate ABC transporter 
Protein accessionYP_001505085 
Protein GI158312577 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.587353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0488805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCATC GACGGACCCG TGGGCGGCGG CTGGCGCTCG CGTCCGCCGG CCTGGCGGCC 
CTGCTGACGG TCGGCCTCGC CGCGTGCGGC GGCGGCGACT CGGACCCGGC CGCCCCCGCG
TCCGGCGGAA AGGCCGGCAC GCTGCGGATC GGCGACCAGA GTAAGTCCCT GGAGCTCCCG
ATCACGCTGT CGGGCGCGGG CCGCGACACC CCCTACAAGC TGAACTGGAA CAACTTCGCC
GACGGGCCGC ACATGAACGC CGCGTTCAGC GCGGGCCGGC TCGACGTCGG TTTCATGGGC
GACACGCCGG TCCTGTTCGC GAACGCCGCG GACGCCGGAG TGGTCGCCGT GGCCGTGGCG
GAGAACCGGG TGAACAGCCA GACCATCTTC GCCTCCGCCG GCTCCGGCAT CCACAGCCTC
GCCGACCTGA AGGGGAAGCG GGTCGCGTTC ACCCGGGGGA CCTCCCTGCA CGGCTATCTC
CTCAACCAGC TCGACTCGGT CGGGCTCACC CAGGACGACG TCACCCCGGT CAACGTCCCG
GCGGCGAGCC TGCCCGCCAC CTTCTCCTCC GGAGCGGTGG ACGCCGTGGT GTACGTCCGC
CAGTTCGGCG CGGCGGTCAC CGCGCAGAGC GCCGGGTCCT ACGAGGTGGA GACCAAGCCG
CTGCCGCAGT ACTCGGTGCT GCTCGCGGCG AAGGACGCGC TGGCGGACCC GGCCCACCGC
GAGGCGGTGC GGGACTTCGT GCTCCGCCTC TCCCGGGCCT CGGCGTGGCC CAAGCAGAAC
CCGGACGAGT GGATCCAGAA GTACTACGTG GAGACGCTCA AACAGGATCC GGTCGCCGCG
CGGAAGTACT TCGAGAGCCT GCCGGAGAGC AGGTACACCC CGGTCACGGC GGCGTTCGTC
GACAGCCAGC GGACGCAGGC GAAGCTGCTC GTCGACGTCG GGGAGCTGCC CCCGTCGCTG
AACGTCGACG ACGAGGTCGA CAAGGCGTTC ACCACCGAGC TGACCGCGGC GTTCACCGCC
GCGAGCCTGC CGACATGA
 
Protein sequence
MHHRRTRGRR LALASAGLAA LLTVGLAACG GGDSDPAAPA SGGKAGTLRI GDQSKSLELP 
ITLSGAGRDT PYKLNWNNFA DGPHMNAAFS AGRLDVGFMG DTPVLFANAA DAGVVAVAVA
ENRVNSQTIF ASAGSGIHSL ADLKGKRVAF TRGTSLHGYL LNQLDSVGLT QDDVTPVNVP
AASLPATFSS GAVDAVVYVR QFGAAVTAQS AGSYEVETKP LPQYSVLLAA KDALADPAHR
EAVRDFVLRL SRASAWPKQN PDEWIQKYYV ETLKQDPVAA RKYFESLPES RYTPVTAAFV
DSQRTQAKLL VDVGELPPSL NVDDEVDKAF TTELTAAFTA ASLPT