Gene Phep_2681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2681 
Symbol 
ID8253789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3141492 
End bp3143006 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content45% 
IMG OID644936329 
Productsulphate transporter 
Protein accessionYP_003092944 
Protein GI255532572 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.529175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAT TCCTGCACTT ATTTGATTTT TCGCAAAAAA TAAACTACAA AACAGAGGTC 
CTGGCGGGAT TGACAGTAGC CATGACCATG ATGCCCGAAT CCCTGTCTTT CGCTATACTT
GCCGGTTTAC CACCATTGTC TGGTTTGTAC GCAGCCTTTA TCATGGGTTT GGTAACCTCC
GTTTTTGGTG GCAGGCCAGG CTTAATCTCC GGAGGTGCAG GGGCAACTGT CATTGTACTG
ATCGCACTGA TGAAATCACA TGGCCTGGAT TATGTATTTG CCGCTGTCGC CATGGCAGGA
CTGTTTCAGT TGGCGGTAGG CTTGTTTAAA CTCGGAAAGT TTGTACGCCT CGTGCCACAG
CCTGTTATGT TTGGTTTTGT CAACGGCCTC GCCATCGTCA TTTTCATGGC ACAGCTGGAG
CAGTTTAAGG TAAATATAAA CGGAGAACTG GTCTGGATGA GCGGAAAAAC CCTGTACGTA
ATGCTGGGTC TGGTTACTTT AACCATTGCC ATCACCATTT TGTGGCCAAA AATAACCAAA
GCCATTCCCG CTTCCCTGGT AGCCATTCTT GCTGTTTTTG TGCTTGTATT GGGTTTTAAT
ATAGATACCA AAACGGTTAA AGATATAGCC GCAGTTGGCG GGGGCTTTCC TCCCTTTCAC
ATCCCGAAAG TCCCACTTAC TTTCGAAACC CTGCAGGTCA TATTTCCCTA TGCCTTAATT
ATGGCGGCTG TGGGCTTAAC CGAAGGACTT TTAACCTTAA ACCTGGTGGA CGAGATTACC
GGTACCCAGG GAAATGGAAA TAGAGAATGT CTGGCTCAGG GTAGCGCAAA CCTGCTTAAC
GGCTTTTTTT ATGGCATGGG GGGCTGTCCC ATGATCGCAC AGACACTGGT TAACCTGTCT
GCCGGAGCCA GAGCCCGTTT GTCTGGCATT ATTGCCGCTA TAACCATTCT GATCATTATT
TTATTTGGTG CTCCATTGAT AGAGAAAGTA CCTATGGCAG CCTTAACGGG TGTAATGATC
ATGGTAGCTA TTGGTACTTT TGAATGGATG AGTTTCAGGA TCATCAACAA AATGCCGCGG
CAAGATATTT TTGTAGGCAT TTTAGTTGCC CTGATTACCG TCTGGCTGCA CAACCTGGCT
TTGGCAGTGC TCATCGGTGT TATCATTTCT GCACTGGTTT TTGCCTGGGA AAGTGCCAAA
AGGATCAGGG CAAGAAAATA TACAGATGAA AACGGAGTAA AACATTATGA ACTGTATGGC
CCTTTGTTTT TTGGTTCAGT AGCCGGATTT ATGGAAAAAT TTGATATTAC CAATGATCCG
GAAAAAGTGG TCATCGATTT TCGGGACAGC AGAATTGCAG ATATGTCTGG CATAGAAGCC
CTGAACAAAC TTACTGAACG GTACAGAAAA ACGGGAAAAC AACTGCAACT GAAACATTTA
AGCAATGATT GCAGACTGCT ACTCAAAAAT GCCGATGGGG TGATTGCAGT CAATATTCTG
GAAGATCCGA CCTAA
 
Protein sequence
MKPFLHLFDF SQKINYKTEV LAGLTVAMTM MPESLSFAIL AGLPPLSGLY AAFIMGLVTS 
VFGGRPGLIS GGAGATVIVL IALMKSHGLD YVFAAVAMAG LFQLAVGLFK LGKFVRLVPQ
PVMFGFVNGL AIVIFMAQLE QFKVNINGEL VWMSGKTLYV MLGLVTLTIA ITILWPKITK
AIPASLVAIL AVFVLVLGFN IDTKTVKDIA AVGGGFPPFH IPKVPLTFET LQVIFPYALI
MAAVGLTEGL LTLNLVDEIT GTQGNGNREC LAQGSANLLN GFFYGMGGCP MIAQTLVNLS
AGARARLSGI IAAITILIII LFGAPLIEKV PMAALTGVMI MVAIGTFEWM SFRIINKMPR
QDIFVGILVA LITVWLHNLA LAVLIGVIIS ALVFAWESAK RIRARKYTDE NGVKHYELYG
PLFFGSVAGF MEKFDITNDP EKVVIDFRDS RIADMSGIEA LNKLTERYRK TGKQLQLKHL
SNDCRLLLKN ADGVIAVNIL EDPT