Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0735 |
Symbol | |
ID | 5669151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 855528 |
End bp | 857192 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239662 |
Product | sulphate transporter |
Protein accession | YP_001505099 |
Protein GI | 158312591 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.725862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00858649 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGCCCCAGC CCGCCGTTCG CCGTGCCCGC CTTCCCCGGG CCCGGCCACG GCTGGCCGGT GCCCGCTGGG CGGCGCTGGC CGGCGTCCGG TGGGGCGTCC GGTGGCCGGG GCTGCGCGTG GTCCGCGTCG AGCTGCTGGC CGGCCTGGTC ACCGCGCTGG CGCTCATCCC GGAGACCATC TCGTTCTCGG TCGTCGCCGG GGTGGATCCC AAGGTCGGCC TGTTCGCGTC GTTCACGATC TCCGTCGTCA TCGCCTTCAC CGGCGGCCGC CCGGCGATGA TCTCGGCGGC GGCGGGCTCG ATGGCGCTCG TCGCGGCGCC GCTGGTCCGC GACCACGGCC TGGACTACCT GCTCGCGACC ACGGTCGGCG TCGGCGTGAC GATGTTCGTG CTCGGCCGGC TCGGGGTCGC GCGGCTGATG CGCTTCGTGC CGCGCACGGT GATGATCGGC TTCGTCAACG CGCTGGCCAT CCTGATCTTC ATGGCGCAGA TGCCGCACGT CATCGGCGAG TCGCTGCCGG TCTACCTGCT GGTCGGGGCC GGCCTGGCCA TCCTGCTCAT CGTCCCGCGG CTCACCCGGG CCGTGCCGCC GCCGCTGGTC GCGGTCGGCC TGCTGACCCT GACGACCGTC GCGTTCGACC TGTCGGTGCC GACCGTCGGC GACGAGGGCG CCCTGCCCGG CTCGCTGCCC GTCCCGGGGC TCCCGTCCGT CCCGATGACC TGGGAGACGC TGCGGATCAT CGCGCCGTAC GTCGCCACGC TCACCGCGGT CGGCCTGCTG GAGACCCTGC TCACCGCGCA GATCGTCGAC CGGATGACCG ACACCACGCA CGACGCCAAC CGCGAGGCGT GGGGCCTGGG CCTCGCCAAC GTCGTCAACG GCTTCCTCGG CGGCATGGCC GGCTGCGCCA TGATCGGCCA GACGATGGTC AACGTCTCCT CCGGCGGGCG GACGCGCCTG TCCACCTTCG CCGCCGGCGC CTTCCTGCTC CTTCTCGTGG TCCCGCTGCG GGGGATCGTC GGCGTGATCC CGATGTCCGC CCTGGTGGCG GTCATGATCC TGGTCTCGGT GATGACCTTC GAGTGGCGCA GCGTGGCGCC GTCCACGCTG CGGCGGATGC CGCGCTCGGA GACCCTGGTG ATGGTCGCGA CCGTCGCGGT GGTCGTGCCC ACCCACAACC TGGCGTACGG GGTGCTGGTG GGTGTCGTGC TCTCGGCGCT GCTGTTCACC CGCCGGGTCG CGCACCTCAC CGAGGTCACC AGCGTGCTCG ACCCGGAGGG GGGCGAGCGC CTCTACGCCG TGGGCGGCCC GGTCTTCTTC GCCTCCAGCA ACGACCTGGT GAACGCCTTC GACTACGCCC ACGACCCGCC CCGGGTCGTC ATCGACCTCG CCGACGCCCA GATCCTGGAC TCCGCCGGGG TGGCCGCGTT GGACGACGTC CTGCAGAAGT TCGCCGAGCG TGGCACCACC GTGCACCTGG CCGGCGCGAC CGGCGGCACC GAGGCGATGC TCGCCCGGCT CTCGACCGCG CCGCCGGACC GGGCCGCCGG CCGCGAACCG TGTGAACCGT GCCCGGAGGA ATCGCCTGGC GCAACGAAGA AATCCCGGCC CCGAACGGCC GACAGACCCG ATGCCGAAGC TGGGCCGAGC ACGCCCCGAC GGTGA
|
Protein sequence | MPQPAVRRAR LPRARPRLAG ARWAALAGVR WGVRWPGLRV VRVELLAGLV TALALIPETI SFSVVAGVDP KVGLFASFTI SVVIAFTGGR PAMISAAAGS MALVAAPLVR DHGLDYLLAT TVGVGVTMFV LGRLGVARLM RFVPRTVMIG FVNALAILIF MAQMPHVIGE SLPVYLLVGA GLAILLIVPR LTRAVPPPLV AVGLLTLTTV AFDLSVPTVG DEGALPGSLP VPGLPSVPMT WETLRIIAPY VATLTAVGLL ETLLTAQIVD RMTDTTHDAN REAWGLGLAN VVNGFLGGMA GCAMIGQTMV NVSSGGRTRL STFAAGAFLL LLVVPLRGIV GVIPMSALVA VMILVSVMTF EWRSVAPSTL RRMPRSETLV MVATVAVVVP THNLAYGVLV GVVLSALLFT RRVAHLTEVT SVLDPEGGER LYAVGGPVFF ASSNDLVNAF DYAHDPPRVV IDLADAQILD SAGVAALDDV LQKFAERGTT VHLAGATGGT EAMLARLSTA PPDRAAGREP CEPCPEESPG ATKKSRPRTA DRPDAEAGPS TPRR
|
| |