Gene Phep_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3587 
Symbol 
ID8254709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4269923 
End bp4271593 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content43% 
IMG OID644937239 
ProductNa+/solute symporter 
Protein accessionYP_003093840 
Protein GI255533468 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGATA CAATAGTCGT AGTCGTTTTT TCTATTTTCA TCATGCTGGT AGGCATCAGT 
TTTTCCAAAA CGGGAAGAAA CTTAAAATCC TTTTTTGCAG GGGGCGAATC TGTTCCCTGG
TTTATTGGTG GTTTATCCTT ATTCATGAGT TTTTTTTCTG CAGGAACTTT TGTTGTCTGG
GGTGCAATTG CCTATCAACA CGGATGGGTA GCGGTTACTA TTCAATGGAC GATGTGTATC
GGCGCGTTAA TTACCGGTCT TTACCTTGCG CCAAGGTGGA AAGCCACCGG CAACCTTACC
GCAGCCGAGT TTATCAAGGC AAGACTGGGC AGCAATGTTC AGAAAAGCTA CATTTTTATA
TTCACCATTG TGTCAGTATT TATTAAAGGG TCAGTTCTAT ACTCTGTGGC CAAACTGGTT
TCTGGCTCAC TGGACTATCC GCTGATGCAG GTTACAGTTG TGCTGGGAAT TTTAATGATT
TCCTACACGG CTATAGGCGG CTTATGGGCA GTGATGGTTA CAGATATTTT ACAATTCGTA
GTGCTTACAG CTGCGGTGCT GATTATTTTG CCCATGGTAT TTAACGAGGT AGGTGGTGTG
CAGGGCTTTA TAGACAAAGC TCCGGATGAT TTTTTCAACC TGGTTCATGG CGAATATACC
TGGGGTTTTA TCTTTGCCTT TGCCATATAC CATATCTTTT ATATTGGTGG CAACTGGACA
TTTGTACAGC GCTATACCAG TGTAGATTCA CCTAAAAGTG CTTCAAAGGT AGCTTATCTT
TTTGCCGGCC TGTATATTTT AAGCCCTGTG CTTTGGATGT TGCCACCGAT GGCTTACCGG
GTTATTAATC CGGCACTTTC GGGACTGGAT GCCGAGAATG CCTACATTAT GGTTTGTAAA
CAGGTGCTTC CGGCAGGGCT CCTGGGCTTA ATGCTGACGG GTATGTATTT CTCCACTTCG
GCATCTGCCA ATACAGCACT TAATGTGGTA TCGGCAGTAT TTACCAATGA CATCTACAAA
GGATCTGTAA ATCCTGATGC CGATGATAAA AAACTAATGT TCGTAGCGCG GGCTTCTTCC
TGGTTTTTTG GTCTGCTGAT GATCGTGATT GCGCTGGGCG TTCCCTATAT AGGTGGTATT
GTTGAGTTTA CATTAAGTGT GGGGGCCCTA ACGGGAGGGC CGTTACTGCT GCCGCCAATC
TGGTCCCTGT TTTCAAAGCG CCTGTCCGGG AAAGCAACTA TTTATATCAC GTTGATATCA
CTTTCGGTAA ATGTGGTATT TAAGATGATC ATACCTTTTA TAGATGGTTA TAAATTATCA
AGGGCCAATG AAATGTTGGT GGGGGTTTTA CTGCCGTTTT TCATGTTGTT GATTTATGAG
ATCGTCAGAA GAAAGACTAA AGTAAGTCAG GATTACGAGA ACTACCTGGT ATATAAAGCA
GACAAGAAGC AGGCCGCTGT AGCTATTGAC GATGAAGAGC TGCAGATGAT AAAAAAGCAA
AATGTATTTG GTTTAAAAAT GATCAGCTTC TCCTTATTGT TTATGTCTGT TTTACTGTTG
CTGCTGGCTT TTATTACTTC AAAAGGAAAT GGGTTAGTGG TTATCATTGC TATTGTCATC
ATGTTTGGTG CAATTATTCC CTGGAGGGCT TCAAGACGTA AAGCGGCATG A
 
Protein sequence
MIDTIVVVVF SIFIMLVGIS FSKTGRNLKS FFAGGESVPW FIGGLSLFMS FFSAGTFVVW 
GAIAYQHGWV AVTIQWTMCI GALITGLYLA PRWKATGNLT AAEFIKARLG SNVQKSYIFI
FTIVSVFIKG SVLYSVAKLV SGSLDYPLMQ VTVVLGILMI SYTAIGGLWA VMVTDILQFV
VLTAAVLIIL PMVFNEVGGV QGFIDKAPDD FFNLVHGEYT WGFIFAFAIY HIFYIGGNWT
FVQRYTSVDS PKSASKVAYL FAGLYILSPV LWMLPPMAYR VINPALSGLD AENAYIMVCK
QVLPAGLLGL MLTGMYFSTS ASANTALNVV SAVFTNDIYK GSVNPDADDK KLMFVARASS
WFFGLLMIVI ALGVPYIGGI VEFTLSVGAL TGGPLLLPPI WSLFSKRLSG KATIYITLIS
LSVNVVFKMI IPFIDGYKLS RANEMLVGVL LPFFMLLIYE IVRRKTKVSQ DYENYLVYKA
DKKQAAVAID DEELQMIKKQ NVFGLKMISF SLLFMSVLLL LLAFITSKGN GLVVIIAIVI
MFGAIIPWRA SRRKAA