Gene Franean1_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0735 
Symbol 
ID5669151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp855528 
End bp857192 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content73% 
IMG OID641239662 
Productsulphate transporter 
Protein accessionYP_001505099 
Protein GI158312591 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.725862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00858649 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCCCCAGC CCGCCGTTCG CCGTGCCCGC CTTCCCCGGG CCCGGCCACG GCTGGCCGGT 
GCCCGCTGGG CGGCGCTGGC CGGCGTCCGG TGGGGCGTCC GGTGGCCGGG GCTGCGCGTG
GTCCGCGTCG AGCTGCTGGC CGGCCTGGTC ACCGCGCTGG CGCTCATCCC GGAGACCATC
TCGTTCTCGG TCGTCGCCGG GGTGGATCCC AAGGTCGGCC TGTTCGCGTC GTTCACGATC
TCCGTCGTCA TCGCCTTCAC CGGCGGCCGC CCGGCGATGA TCTCGGCGGC GGCGGGCTCG
ATGGCGCTCG TCGCGGCGCC GCTGGTCCGC GACCACGGCC TGGACTACCT GCTCGCGACC
ACGGTCGGCG TCGGCGTGAC GATGTTCGTG CTCGGCCGGC TCGGGGTCGC GCGGCTGATG
CGCTTCGTGC CGCGCACGGT GATGATCGGC TTCGTCAACG CGCTGGCCAT CCTGATCTTC
ATGGCGCAGA TGCCGCACGT CATCGGCGAG TCGCTGCCGG TCTACCTGCT GGTCGGGGCC
GGCCTGGCCA TCCTGCTCAT CGTCCCGCGG CTCACCCGGG CCGTGCCGCC GCCGCTGGTC
GCGGTCGGCC TGCTGACCCT GACGACCGTC GCGTTCGACC TGTCGGTGCC GACCGTCGGC
GACGAGGGCG CCCTGCCCGG CTCGCTGCCC GTCCCGGGGC TCCCGTCCGT CCCGATGACC
TGGGAGACGC TGCGGATCAT CGCGCCGTAC GTCGCCACGC TCACCGCGGT CGGCCTGCTG
GAGACCCTGC TCACCGCGCA GATCGTCGAC CGGATGACCG ACACCACGCA CGACGCCAAC
CGCGAGGCGT GGGGCCTGGG CCTCGCCAAC GTCGTCAACG GCTTCCTCGG CGGCATGGCC
GGCTGCGCCA TGATCGGCCA GACGATGGTC AACGTCTCCT CCGGCGGGCG GACGCGCCTG
TCCACCTTCG CCGCCGGCGC CTTCCTGCTC CTTCTCGTGG TCCCGCTGCG GGGGATCGTC
GGCGTGATCC CGATGTCCGC CCTGGTGGCG GTCATGATCC TGGTCTCGGT GATGACCTTC
GAGTGGCGCA GCGTGGCGCC GTCCACGCTG CGGCGGATGC CGCGCTCGGA GACCCTGGTG
ATGGTCGCGA CCGTCGCGGT GGTCGTGCCC ACCCACAACC TGGCGTACGG GGTGCTGGTG
GGTGTCGTGC TCTCGGCGCT GCTGTTCACC CGCCGGGTCG CGCACCTCAC CGAGGTCACC
AGCGTGCTCG ACCCGGAGGG GGGCGAGCGC CTCTACGCCG TGGGCGGCCC GGTCTTCTTC
GCCTCCAGCA ACGACCTGGT GAACGCCTTC GACTACGCCC ACGACCCGCC CCGGGTCGTC
ATCGACCTCG CCGACGCCCA GATCCTGGAC TCCGCCGGGG TGGCCGCGTT GGACGACGTC
CTGCAGAAGT TCGCCGAGCG TGGCACCACC GTGCACCTGG CCGGCGCGAC CGGCGGCACC
GAGGCGATGC TCGCCCGGCT CTCGACCGCG CCGCCGGACC GGGCCGCCGG CCGCGAACCG
TGTGAACCGT GCCCGGAGGA ATCGCCTGGC GCAACGAAGA AATCCCGGCC CCGAACGGCC
GACAGACCCG ATGCCGAAGC TGGGCCGAGC ACGCCCCGAC GGTGA
 
Protein sequence
MPQPAVRRAR LPRARPRLAG ARWAALAGVR WGVRWPGLRV VRVELLAGLV TALALIPETI 
SFSVVAGVDP KVGLFASFTI SVVIAFTGGR PAMISAAAGS MALVAAPLVR DHGLDYLLAT
TVGVGVTMFV LGRLGVARLM RFVPRTVMIG FVNALAILIF MAQMPHVIGE SLPVYLLVGA
GLAILLIVPR LTRAVPPPLV AVGLLTLTTV AFDLSVPTVG DEGALPGSLP VPGLPSVPMT
WETLRIIAPY VATLTAVGLL ETLLTAQIVD RMTDTTHDAN REAWGLGLAN VVNGFLGGMA
GCAMIGQTMV NVSSGGRTRL STFAAGAFLL LLVVPLRGIV GVIPMSALVA VMILVSVMTF
EWRSVAPSTL RRMPRSETLV MVATVAVVVP THNLAYGVLV GVVLSALLFT RRVAHLTEVT
SVLDPEGGER LYAVGGPVFF ASSNDLVNAF DYAHDPPRVV IDLADAQILD SAGVAALDDV
LQKFAERGTT VHLAGATGGT EAMLARLSTA PPDRAAGREP CEPCPEESPG ATKKSRPRTA
DRPDAEAGPS TPRR