Gene Phep_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2097 
Symbol 
ID8253202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2417202 
End bp2418794 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content45% 
IMG OID644935746 
ProductABC transporter related 
Protein accessionYP_003092364 
Protein GI255531992 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0494627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACCC TACAGGATAT TACGTATACA CATCCCGACC GGGATGTATT GTTTAGGGGT 
TTAAACCTCA TCATTAACAA AAAAGACAAA ATTGCGCTTA TTGGCGACAA TGGCACAGGA
AAATCAAGCC TGCTAAATAT GATGGCAGGT AATTTACAAC CTGCCTCAGG AAGCGTTAAA
ATGAGTTCAA TCCCCTATTA TGTACCACAG ATTTTCGGAC AGTTTAATGA TTACAGCATT
GCAAAGGCGC TGCAGGTTGA AGGTAAATTA AAAGCCTTGA ATGAAATTCT GGATGGCCGG
ATGACGACTG AAAATATGGT TCTGCTGAAT GACGACTGGG CGGTTGAGGA ACGCTGCAAA
AATGCGCTTG CACATTGGAA CCTTGAGGGA CTGGACCTGG CCCAAAAAAT GGGCAGCCTT
AGTGGTGGCC AGAAAACCAA AGTTTTTTTA GCAGGGATAC GCATACACCG GCCTGAGATA
GTTTTACTTG ATGAGCCCAG TAACCATTTA GACCTATGGA GCAGAAAGCG GCTGTATGAT
GACCTTGCCT CGGCCACTAA CACGCTGGTG GTGGTAAGCC ATGATAAAAC CTTACTGAAG
CTTCCGGAAC GCGTTTTTGA ACTGGATAAG CGGGGCATAA CCGTGTATGG AGGCAATTAC
GATTTTTATG TGGCACAGAA AAAGCTGGAA AGCGAAGCGT TGAAACAGGA CCTGAACAGC
AGGGAAAAGG CACTTCGTAA AGCCAGGGAA ACTGAAAAAG AAGTACTGGA AAGGCAACAA
AAACTGGACG CCCGTGGCAA AAAGAAACAG GAAAAAGCGG GTTTGCCTAC TATTGTGATG
CATGCCTTTA AAAACAATGC AGAAAAAAGC AGCTCCCGTA TAAAAAGTGT CCATGAAGAT
AAAAAGGCTG TGCTTTCGCA GGAACTGGGC CAGTTGCGTG CAGCATTGCC TGACATCAAT
AAAATGAAAA TGGACCTGAA CAATTCTGCC CTGCACAGGG GAAAAACATT GGTCAGCGCA
AAAAATATAA ATTTTGGATA CTACCATCAG TTGCTTTGGA AGGAGCCTTT GAATTTTCGG
CTTAACAGTG GCGAACGTAT GGTCATCCGG GGCGCGAACG GGTCCGGTAA GACAACACTG
ATCAAAATGA TCTTAGGCTG GCTTCAACCC AGTTCAGGAA CATTAAACAG TCTTAGCGGC
ATTAAAACCA TTTACATTGA CCAGGATTAC TCGCTGATTG ACAACAACCT GAGTGTATAT
GAACAGGCAC AGGCCTATAA TTCGGGTGAA TTGCAGGAAC ATGAAATAAA GATCCGCTTA
AACCGCTTTT TATTTGACAA AGCCTACTGG AACAAATCCT GTGCAGCGCT GAGCGGCGGC
GAGAAAATGC GCCTGATGCT TTGCGCGCTA ACGATAAGCA ATGCCGCTCC CGACCTGATT
GTACTGGATG AACCCACAAA TAACCTGGAC ATTCAAAATA CAGGGATCCT GACTGCTGCA
ATTGGTGATT ATAAGGGAAC ACTGCTGCTG GTGTCTCATG ATGAGTTATT TTTGAAGCAG
ATAAATGCAG TACATTCAAT TGAGTTGCAT TAA
 
Protein sequence
MITLQDITYT HPDRDVLFRG LNLIINKKDK IALIGDNGTG KSSLLNMMAG NLQPASGSVK 
MSSIPYYVPQ IFGQFNDYSI AKALQVEGKL KALNEILDGR MTTENMVLLN DDWAVEERCK
NALAHWNLEG LDLAQKMGSL SGGQKTKVFL AGIRIHRPEI VLLDEPSNHL DLWSRKRLYD
DLASATNTLV VVSHDKTLLK LPERVFELDK RGITVYGGNY DFYVAQKKLE SEALKQDLNS
REKALRKARE TEKEVLERQQ KLDARGKKKQ EKAGLPTIVM HAFKNNAEKS SSRIKSVHED
KKAVLSQELG QLRAALPDIN KMKMDLNNSA LHRGKTLVSA KNINFGYYHQ LLWKEPLNFR
LNSGERMVIR GANGSGKTTL IKMILGWLQP SSGTLNSLSG IKTIYIDQDY SLIDNNLSVY
EQAQAYNSGE LQEHEIKIRL NRFLFDKAYW NKSCAALSGG EKMRLMLCAL TISNAAPDLI
VLDEPTNNLD IQNTGILTAA IGDYKGTLLL VSHDELFLKQ INAVHSIELH