Gene Phep_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2204 
Symbol 
ID8253310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2535979 
End bp2537664 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content46% 
IMG OID644935853 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003092470 
Protein GI255532098 
COG category[R] General function prediction only 
COG ID[COG4146] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0201601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACAC TCCACAAATA CGATTACATC GTATTTCTAG TTTATTTTTT AATAGTATCC 
AGCTACGGTT TCTGGATCTA TTATAAAAAG AAATCAGCTG CTGCCGATTC AAAAGATTAT
TTTCTTGCCG AGGGTTCGCT CACCTGGTGG GCCATTGGCG CCTCGCTGAT TGCCTCAAAT
ATCTCTGCCG AACAGATGAT CGGGATGAGT GGCTCCGGAT TTAAACTCGG ACTGGCCATT
TCGGCCTACG AATGGATGGC TGCAGCAACA TTGATCATTG TGGCCGTGTT CTTTATGCCG
GTTTACCTGA AGAACAAGAT CTTTACCATG CCCCAGTTTT TAAGTCAGCG TTACAACGAA
AAAGTAGCCA TGATCATGGC TGTATTCTGG CTCATGCTGT ATATAGTGGT TAACCTGATG
TCTATCCTGT ACCTCGGTGC ACTGGCCATC AGTGGTATAT CCGGGTTAAA CATCACGGCA
TGTATCCTTG GCCTGGCAGT TTTTGCCATC ATCATTACCC TGGGGGGGAT GAAGGTCATC
GGCTATACCG ATGTGATCCA GGTATTCTTC CTGGTACTGG GCGGTTTGGT AGCCACTTAT
ATTGCCCTCA ACCTGATCTC AGGTCAGCAG GGTATCGTAA AAGGCTTTGC CATCCTCACC
GATGGGGCCT CCGAGCATTT CCACCTCATC TTTAAAAAGG ACGATCCCAA TTATATGGAC
CTTCCGGGAC TCAGCGTATT GATTGGTGGT ATGTGGATTG CCAACCTGAG CTATTGGGGC
TGTAACCAGT ACATTACCCA AAGGGCACTG GGTGCATCCC TGCCCGTTGC CAGATCAGGC
CTTTTGTTTG CCGCCTTCCT TAAAATGCTG ATGCCGGTTA TTGTTGTTAT CCCAGGTATA
GCGGTATATT ATATCATTAA AGAGAAAATC CCGGGTATCA GCGGCAGCGA TCTGCTTACC
TCTTCAGGTG TACAGGATCC CAACAAAGCG TATCCGGCCT TACTGGGCTT ATTGCCCATC
GGTTTAAAAG GTTTGTCATT TGCCGCCTTA ACCGCTGCCA TCGTTGCTTC CCTTGCCGGA
AAAGCCAACA GCATTGCCAC TATATTTACA TTAGATATCT ATAAAAAAGC CTTCAATACC
AATGCTGGGG AAGGTACATT GGTCAACGTA GGGAAAATTA CGGTAATCGT ATCCATGCTG
CTGGCCGTAG TGCTTTCATT AATTGTTGGT GATGCCTTAA TGGGCGAGGG TAAACAGGGC
TTCCAGTACA TTCAGGAATA CACAGGTTTT GTGTCTCCGG GTATTTTTGC CATGTTCATC
CTGGGCTTTT TCTGGAAAAA GACCACTTCA AACGCAGCCT TATTTGCCAC GGTAGGCGGC
TTTATTGTTT CCGTGCTCCT TAAGTTCCTG CCGGGATGGG TAGATCTTTC CTTCTTACAT
GAATACGGAT GGGCAGTAGC CAATTCTGCA GGTGTATTTG AAATGCCATT TATGGACAGG
ATGCTGATCG TATTTGCCGT ATGTGTAATC GGTATGTATT TCATCAGTAT CTATGAAAAC
AGAAACGGCA TCATCCCTAA CGGACTGGAA GTAGATCCGA AAATGTTTCG CGTTTCCACT
TCATTTGCAG TAGGGGCATT GATTATTGTG GCCATGCTGG TGGCACTTTA TTCTGCCTTC
TGGTAA
 
Protein sequence
MNTLHKYDYI VFLVYFLIVS SYGFWIYYKK KSAAADSKDY FLAEGSLTWW AIGASLIASN 
ISAEQMIGMS GSGFKLGLAI SAYEWMAAAT LIIVAVFFMP VYLKNKIFTM PQFLSQRYNE
KVAMIMAVFW LMLYIVVNLM SILYLGALAI SGISGLNITA CILGLAVFAI IITLGGMKVI
GYTDVIQVFF LVLGGLVATY IALNLISGQQ GIVKGFAILT DGASEHFHLI FKKDDPNYMD
LPGLSVLIGG MWIANLSYWG CNQYITQRAL GASLPVARSG LLFAAFLKML MPVIVVIPGI
AVYYIIKEKI PGISGSDLLT SSGVQDPNKA YPALLGLLPI GLKGLSFAAL TAAIVASLAG
KANSIATIFT LDIYKKAFNT NAGEGTLVNV GKITVIVSML LAVVLSLIVG DALMGEGKQG
FQYIQEYTGF VSPGIFAMFI LGFFWKKTTS NAALFATVGG FIVSVLLKFL PGWVDLSFLH
EYGWAVANSA GVFEMPFMDR MLIVFAVCVI GMYFISIYEN RNGIIPNGLE VDPKMFRVST
SFAVGALIIV AMLVALYSAF W