Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_3029 |
Symbol | |
ID | 8254141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3620746 |
End bp | 3622224 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644936678 |
Product | Na+/solute symporter |
Protein accession | YP_003093289 |
Protein GI | 255532917 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00934608 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCCGA TTATTCTTTT GTCCTTTCTT CTAGGCTATT TTGCTTTATT AGTTGGGGTG GCTTATTTCA CCTCAAGAAA CAACTCAGAC AACTCTTCTT TCTTCATTGC AAACAGAAAT TCCAAATGGT ACCTGGTCGC TTTTGGAATG ATAGGAACGG CATTATCCGG CGTAACTTTT ATATCGGTAC CCGGGGCTGT AGGCAAAAGT GATTTTGGTT ATTTTCAGTT CATACTGGGC AATGCAGTCG GCTTTATCAT TATTGCTACG GTATTGCTCC CATTATATTA CAGGTTAAAT ATCATTTCTA TTTATACCTA TCTTGAAAGA CGTCTGGGTT TCTGGAGTTA TAAAAGCGGT GCCGTAATTT TCCTGGTATC CCGTACTATT GGGTCTGCTT TTAGGCTTTA CCTTGTTGCT ATTGTGTTGC AGAAGTTCAT TTTTGATGCC TGGAGTATAC CTTTCTGGTT AACTATAGTT ATCTGCCTGG TACTCATCTG GCTGTACACA CACAAAGGAG GATTAAAAAC TATTATCATT ACCGATACCT TGCAAACAGT ATTCCTTTTG CTATCTGTAG TGCTATCAAT CATATTTATT GCCAGATCTT TAAACCTGGA TATTGCCGGT ACTTTCGAGG CTGTAAAAAA CAGTAGCTAT TCCAAAATCT TTTTCTGGGA GGATTTTCTG GGTAGTAAAA CGCATTTCCT CAAGCAGTTT TTTGGGGGTA TTTTTGTAAC CATTGCCATG ACCGGACTGG ATCAGGATTT GATGCAGAAA AACCTAAGTA TGAAAACCAT TGGCGAAGCA CAAAAAAATA TGTTCACCTT CACCACCGTT TTTGTGATCA TGAATATCTT CTTTTTAAGT GTAGGTGCTT TGTTATATCT TTTCGCTGCA AAAAACGGAA TTGATGTTGC AGCATTAAAA ACACCTGACC ACCTTTATCC GGAAATTGCA TTAAACCACC TGAATGTCAT ACCAGGCATT ATTTTTATGC TGGGATTAAC CGCCGCGACA TTTGCGACCA CTGATTCGGC ATTAACGGCA TTAACAACTT CATTTTGTGT AGATTTTTTA CATTTCGATA AAAAAGCCGA TCAGAATGAT CCTGCACTGG TTAGCAAACG GCACTTGGTG CACATTGGAT TCTCTGTGCT GATGGTCGTT GTGATCATGA TCTTTAAAAT CATCAACGAT GACTCGGTAG TAAATGCCAT TTTTAAAGCA GCCGGGTATA CTTACGGACC ATTGGTAGGT CTTTTTGGTT TTGGAATGCT GACTAAAAAG GCTGTAACCG ACAAGCTTGT ACCTTATATC TGTATACTTT CACCAATATT GTGCTTTATA ATTGATATCA ATTCTTTAAA CTGGTTTGGA TACGCTTTAG GTTTTGAACT GATCATATTA AATGGATTAC TTACATTTGT TATGCTTTGG ATTACTGGTA AAACATCAAC AACCCAAACC AAATTCTAA
|
Protein sequence | MSPIILLSFL LGYFALLVGV AYFTSRNNSD NSSFFIANRN SKWYLVAFGM IGTALSGVTF ISVPGAVGKS DFGYFQFILG NAVGFIIIAT VLLPLYYRLN IISIYTYLER RLGFWSYKSG AVIFLVSRTI GSAFRLYLVA IVLQKFIFDA WSIPFWLTIV ICLVLIWLYT HKGGLKTIII TDTLQTVFLL LSVVLSIIFI ARSLNLDIAG TFEAVKNSSY SKIFFWEDFL GSKTHFLKQF FGGIFVTIAM TGLDQDLMQK NLSMKTIGEA QKNMFTFTTV FVIMNIFFLS VGALLYLFAA KNGIDVAALK TPDHLYPEIA LNHLNVIPGI IFMLGLTAAT FATTDSALTA LTTSFCVDFL HFDKKADQND PALVSKRHLV HIGFSVLMVV VIMIFKIIND DSVVNAIFKA AGYTYGPLVG LFGFGMLTKK AVTDKLVPYI CILSPILCFI IDINSLNWFG YALGFELIIL NGLLTFVMLW ITGKTSTTQT KF
|
| |