Gene Phep_3535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3535 
Symbol 
ID8254656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4203170 
End bp4204804 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content43% 
IMG OID644937186 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003093788 
Protein GI255533416 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.269906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGTA TTCTAAGCAA TATTTTTTCT TTGCTGCTCA TTTTAACTTT TGCAGCATGT 
AAAAACCAAT CTTCCAATGC AGAAAAAAAG GTTTTTAGCA TGAACCTTGA TCAAAATGTG
ACATCCCTTG ATCCCGCTTT TGCGCGCAAC CAGAATGCCA TCTGGATGAT CAACCAGATT
TTTAACGGAC TGGTACAGAT TGACAGTTCC TTAAATACCT TGCCCTGTAT TGCTAAAACA
TGGGAAATTT CGAAAGACGG CTTAACCTAC ACCTTTCACC TCCGCAATGA TGTTTATTTT
CATGACGACC CGATTTTTCC GGGTGGCAAA GGCCGTAAAG CTGTTGCAGC AGATTTCAGA
TACAGCTTTT ACAGGCTTAT AGACCCAAAA GTGGCTTCAT CCGGGGGATG GATCTTCAGC
GATAAGGTAA AAGACATCAA TAGCTTTATT GCGGTAAATG ACACGACTTT CCAGGTAAAA
CTCATGAAGC CTTTTCCTGC TTTTATGAAC CTGCTCACTA CGCAGTACTG CTCAGTGGTT
CCGAAAGAAA TTGTAGAGCA TTATGGTAAG GATTTCCGTA GTCACCCTAT AGGCACGGGT
CCGTTTAAAT TCAAGTACTG GAAAGAGGAT GAGATACTGG TGATGTTAAA AAACGAACAT
TACTGGGAAA AAGAAGGCGA CCGGCAATTG CCCTACCTGG ATGCGGTAAA AGTTACTTTT
ATTAGCGACA AGCAGAGTGC CTTCATGAAT TTCATTAAAA AGGACCTGGA TTATTTTGAT
AAGGTTGATG GCAGCTACCG TGACGATATT TTAACCAAAA GCGGAAAAAT GACCAGTAAG
TACAAAGGCA AATTCAAGCT CAGAAAAGGC CCTTATCTGT GCACAGAATA CGTAGGTATC
CTGGTAGACA CCTCGAAGGC CATTGCTAAA AATTCGCCCT TAAAATATAA AAAGGTAAGA
CAGGCCATTA ATTATGCGAT TGATAAACCC AAGCTGATCA AATATTTAAG GAACAGCATC
GGCATGCCGG CCACCTCTGG ATTTATCCCG CATGGCATGC CTGGTTTTGA CAGTACAGCG
GTTAAAGGTT ACCATTACGA ACCGCAAAAA GCAGCCCGCC TACTGGCCGA AGCTGGTTTC
CCGAACGGAA AGGGTATGCC GCAGATTACC CTCAGCACTT CTACCACTTA TAAAGACCTG
ATCGAATTTA TTCAGGGGGA GCTAAGTGCC ATAGGCATCA ATGTAAAGGT TGATGTAAGC
CCAAGTGCCA GTCTGCGCGA CCTGATCTCA AAAAATGGGG TAAACTTTTT CAGGGGCTCC
TGGATTGCCG ATTATCCGGA TGGCGAGAAC TACCTGGCCA TGTTCTATTC CAGGAACAAA
GTACCGAACG GCCCCAATTA TACAGGTTAT TTTAACGATG AGTTTGACCG TCTTTTTGAA
CAGAGCTATT ACGAAAGCGA CAACCAAAAG CGTTACCTGC TGTATCAGAA AATGGATAGA
ATGATCGTAG AATATGCCAA CGTAGTGCCG ATACTCTACG ATCAATCGCT TGTGATGACA
CAGAACAATA TCAGTGGCCT GCAGGTGAAT GCGTTAAACC TGATGATATT GAAGACTGTC
AGGAAGGAAA ATTAA
 
Protein sequence
MRRILSNIFS LLLILTFAAC KNQSSNAEKK VFSMNLDQNV TSLDPAFARN QNAIWMINQI 
FNGLVQIDSS LNTLPCIAKT WEISKDGLTY TFHLRNDVYF HDDPIFPGGK GRKAVAADFR
YSFYRLIDPK VASSGGWIFS DKVKDINSFI AVNDTTFQVK LMKPFPAFMN LLTTQYCSVV
PKEIVEHYGK DFRSHPIGTG PFKFKYWKED EILVMLKNEH YWEKEGDRQL PYLDAVKVTF
ISDKQSAFMN FIKKDLDYFD KVDGSYRDDI LTKSGKMTSK YKGKFKLRKG PYLCTEYVGI
LVDTSKAIAK NSPLKYKKVR QAINYAIDKP KLIKYLRNSI GMPATSGFIP HGMPGFDSTA
VKGYHYEPQK AARLLAEAGF PNGKGMPQIT LSTSTTYKDL IEFIQGELSA IGINVKVDVS
PSASLRDLIS KNGVNFFRGS WIADYPDGEN YLAMFYSRNK VPNGPNYTGY FNDEFDRLFE
QSYYESDNQK RYLLYQKMDR MIVEYANVVP ILYDQSLVMT QNNISGLQVN ALNLMILKTV
RKEN