Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Phep_2957 |
Symbol | |
ID | 8254068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pedobacter heparinus DSM 2366 |
Kingdom | Bacteria |
Replicon accession | NC_013061 |
Strand | + |
Start bp | 3526083 |
End bp | 3527303 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644936605 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_003093217 |
Protein GI | 255532845 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0772221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGAT TTGTAAAAAA TTATAGCGGC ATTATATGGT TGCTCACAGG AATTGCTTTA GGCAGTATTG CCGGATTAAT ATTCGGTAAA AAAGTAGAGG TATTAAAGCC AATAGGCGAT ATATTTCTAA ACCTCTTATT TACCGCTGTC ATCCCCCTGG TCTTCTTCGC CATTTCTTCT GCTATAGCCA ATATCAAACC TTCAGATAAA CTGAGCAGAA TGATGGGGTT TACCGCAATC GTATTTCTTG CCACGGTACT CATATCTGCC ATACTGACCA TTGTTGCCGT TAAAATATTC CCTATCCATG AAGCATTGGG CAATGCTCAG CTCAGCGAAA AAATAGAAGA CAGCCCCTTT GCCGAGCAAC TTACAAAATT ATTTACCACA ACAGAGTTTT ACGAACTGCT GTCACGCAAA AGCATGCTGG CGATGATCAT CTTTTCCATC ATGATAGGCT TTGCCGCATT AAAAGCCGGA AAGGCCGCAG ATCAGTTTGT TGGTTTCCTG CATTCCGGCA ACGAAGTGTT TAAAGGGGTA TTTATCCTGA TCATGAAAGC AGGCCCTATA GGTCTGGGTG CTTATTTTGC CTACCAGGTT GGTGTCTTCG GTCCACAATT GTTTGGCACC TATGCCAAGT CTTTGGGCCT GTACTATGGC TTTGGGGCTT TTTATTTCGT GGTCATGTTC AGTGTATACG CCTTTGTAGC GGGCGGCATC AAAGGCATCA GAAGGTACTG GAAAAATAAC CTTATTCCTT CTGCAACGGC AGTAGGCACC TGCAGCAGTA TTGCTGTTAT ACCTTCCAAC CTGGATGCCG CAAAAAAAAT GGGCATCCCT GAATACATCG CCAATGTAAC CATTCCCCTT GGCGCAACCC TGCATAAAGA TGGCTCCAGC ATCTCCTCTA TTGTTAAAAT GGCGGTTGTT TTTGCCTTAT TTGGTAAAGG CTTTGATACG GCCGATGCCA TTGTGCTTGC ACTGGGTATG ACAGTGCTGG TAAGTGTGGT AGAAGGCGGT ATCCCAAATG GTGGTTATGT AGGCGAACTG CTGTTCATCT CGGCCTACGG ACTACCGATT GAGGCCCTCC CTCCTGCAAT GATCATTGGC ACATTGGTTG ATCCCATGGC TACCCTGCTC AATGCTACCG GCGATACGGT TGCTTCTATG GTGGTAACCA GGTTTACCGA AGGCCGGCAA TGGATAGACA AGGCAATATA A
|
Protein sequence | MKGFVKNYSG IIWLLTGIAL GSIAGLIFGK KVEVLKPIGD IFLNLLFTAV IPLVFFAISS AIANIKPSDK LSRMMGFTAI VFLATVLISA ILTIVAVKIF PIHEALGNAQ LSEKIEDSPF AEQLTKLFTT TEFYELLSRK SMLAMIIFSI MIGFAALKAG KAADQFVGFL HSGNEVFKGV FILIMKAGPI GLGAYFAYQV GVFGPQLFGT YAKSLGLYYG FGAFYFVVMF SVYAFVAGGI KGIRRYWKNN LIPSATAVGT CSSIAVIPSN LDAAKKMGIP EYIANVTIPL GATLHKDGSS ISSIVKMAVV FALFGKGFDT ADAIVLALGM TVLVSVVEGG IPNGGYVGEL LFISAYGLPI EALPPAMIIG TLVDPMATLL NATGDTVASM VVTRFTEGRQ WIDKAI
|
| |