Gene Phep_0525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0525 
Symbol 
ID8251612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp635288 
End bp636856 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content43% 
IMG OID644934175 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003090811 
Protein GI255530439 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.692758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000137784 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGAACTCAT CACTGCATGT ATTAGATTAC ATCATTATTA TAGTATTTTT GATAGGAACG 
CTCGTGTTCG GACTGGTATT TGCCAGGGGA CAGAAAACGA CAAAAAATTA TTTCCTTGCG
AAAGGCAAGA TCCCTTCCTG GGCAATAGGG ATCTCGCTGC TCGCCACATT GATCAGCAGT
GTAACATTCC TGGCTTACCC GGGTACGGGA TATTCTTCTA ACTGGATCTT ACTGGTTCAG
GGCTTAATGG TACCTGTAGT GCTGCTTGGC GTAATCTGGT TTATTGTTCC CTTATACCGG
AAAGTGATCA ATTTGAGTAC TTACGAATAT TTTGAGCAGA GATTTGGTTC TTTTGCCCGA
TATTACAGTT CACTGGCCTT TGTGCTGAGG CAGTTTTCTG GTATGGGCAC AGTTTTTTTT
CTGCTTGCGG TGGCCTTAGG TAGTATGATC CATGTCAATA CCGCTATAAT TATTCTGGTT
GTGGGGGCTA TAATTATCAT TGTCAATTTA CTGGGAGGAA TTGAGGCAGT AATCTGGCTG
GATGTATTTC AGGGCTTTAT GCTTTTTGCC AGCGGGATCA TTTGCATCAG TATATTGCTC
TTTTCTGTAG ACGGAGGTCC GGCTGAAGTC TGGAAAATTG CTTCTGCTAA CGGCAGAACA
GGTTTTGGAC CTTATGAATG GGACCTTACC AAATTGACTT TTCTGGTGAT GGCTATAAAC
GGGGCTTTTT ATGCGGTACA GAAGTACGCA ACAGATCAGA CGGTGGTGCA GCGTTACCTG
ACTGCAAAAA CTGACCGGTC GGCCATCCGC GCATCACTGC TGGGTATCTT GTTAACCGTT
CCGGTATGGA TCTTGTTTAT GTTTATAGGA ACTGCATTGT TTGTGTTTTA TAAGCAAAAC
CCAATTCCGG CAGATATAAG ACCTGATGCT GTTTTCCCTT ATTTTATTAT GACCAAACTG
CCAACAGGTG TCATAGGGTT AATTCTTTCC GCAATGATTT CTGCAGCCAT CTGTAGTTTA
AGCGCCGATC TGAATTCTCT TGCTGCAGTG GGGGTAGAAG ACTATTATAA GAAATTAAGG
CCCGGCAAAA CAGATAAGGC TTATTTAAAG GCATCGAAAT ATATTGTTGC CTTATCTGGG
CTGATCTCTA TAGGAATAGC CATGTTGTAT CTGAATGCCG GAAATGAAGG GGTGCTGGGG
ATCGTATTTA CGCTGTACGC CATATTTTCA GGCGGCATTG TAGGTATGTT TTTACTGGGT
TTATTTAGTG CCAGGGCCAA TAATCAAGGA ATTACCATTG CCATTGTAGT CTGCATTCTT
TTTACGGCAT ATGCATTTTT AACTTCTACA GAAATCGGAA TTGGGGCAAA TAAATCGCTG
TTGTTAGATT TTGGTAAGTA TAACTTTACA CACCATAAGC TGATGCTGGG TGTATACAGC
CATCTCATCG TTATTGTTGT GGGTTATGTG GCCAGCTTAT TTTTTCCAAA ACCGGTTCTG
GATACCAATT TGCTTTATAG TGGCTGGCTG GCGGTTAGAC GGGAAGAAAG GGCAAGGGCA
GACAAATAG
 
Protein sequence
MNSSLHVLDY IIIIVFLIGT LVFGLVFARG QKTTKNYFLA KGKIPSWAIG ISLLATLISS 
VTFLAYPGTG YSSNWILLVQ GLMVPVVLLG VIWFIVPLYR KVINLSTYEY FEQRFGSFAR
YYSSLAFVLR QFSGMGTVFF LLAVALGSMI HVNTAIIILV VGAIIIIVNL LGGIEAVIWL
DVFQGFMLFA SGIICISILL FSVDGGPAEV WKIASANGRT GFGPYEWDLT KLTFLVMAIN
GAFYAVQKYA TDQTVVQRYL TAKTDRSAIR ASLLGILLTV PVWILFMFIG TALFVFYKQN
PIPADIRPDA VFPYFIMTKL PTGVIGLILS AMISAAICSL SADLNSLAAV GVEDYYKKLR
PGKTDKAYLK ASKYIVALSG LISIGIAMLY LNAGNEGVLG IVFTLYAIFS GGIVGMFLLG
LFSARANNQG ITIAIVVCIL FTAYAFLTST EIGIGANKSL LLDFGKYNFT HHKLMLGVYS
HLIVIVVGYV ASLFFPKPVL DTNLLYSGWL AVRREERARA DK