Gene Phep_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1930 
Symbol 
ID8253034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2231564 
End bp2233351 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content43% 
IMG OID644935581 
Producthypothetical protein 
Protein accessionYP_003092200 
Protein GI255531828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171186 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0291599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA AACTTACGCT CATCCTGTCC ATCCTTACCG GATACCTGCA AGCCCAGATG 
CCACTTGCCC CTTCTAAACA CAATTTAGAA AAAAGAGTAA GTATAGAAGT AAAAAATACC
CAGATATCAG AAGTACTCAC CAGGGTTAGC CGCGCCGGGG CATTCTATTT CTCTTACAGC
GGGGCCTTGT TCACAACAGA TAGCCTGGTT AGTCTGAATG TCAGAAATAC GCCGGTAAGG
GAAATATTGG ACAGGCTTTT TAACAATAAG GTAGATTACA AGGAAAACGG GGAATATATC
ATTCTCCGTT ATGCAGCAAA CCACCTTACC ATAGAACCTG AAAACATCAC CACTGCCGAC
AAGCTTTACC TGATCAGTGG TTATGTAATA GATACAGAAA CGGGTCGTAA GGTTAAACAG
GCCAGTGTTT ATGAAAAACG TCTCTTGCAG TCTACACTTA CAGATCATGA GGGCTTCTTT
AAACTGAGGT TTAAAGGTGA CCATAATGAA GTGATATTAA CAGCTGCCAA AGAAAACTAC
AGAGACACCA CGCTCGTCTT CCTATCTGAC ATTAAAGTGA AACCAGAGGG TTATAAAGAC
CCAAACGCAG ATGAAGCAAA TGGTCTGTTC AGTGATGTCG AAAATTCGGG AATCGGCAGG
TTCTTCATTT CTTCCAAACA AAGGATCCAG AGTTTAAACA TCCCCAGTTT TTTTGCCAAC
AGTCCTTTTC AAACTTCACT TACACCGGGT TTAAGCTCCC ACGGCATCAT GAGCTCCCAG
GTGGTCAACA AATTTTCACT GAATGTTCTG GGTGGTTATA CCGCAGGCAC AGATGGACTT
GAAATTGCAG GGCTTTTTAA CATTACCAAA GGTGATGTGA AGAAATTACA GTTTGCTGGT
CTTTTTAATG AGGCTGGCGG CGCTGTAAAC GGTTTTCAGG TGGCAGGTTT ACTGAACAAT
GTAAGTGGCG AAAAGAAAGG CTTCCAGGCA GCCGGACTGC TTAACCGTGT TAAAGGTGAA
ACTGAAGGCT TTCAGGTTGC CGGGCTTTGC AACTTGTCGG CCAGGAGTAT GAAGGGTGTA
CAGGCAGCAG GAATTGTAAA CGTTATTAAA GAAAATGTTG ATGGGGTACA AATTGCTGGC
ATTGCCAACC TGGTACGCAA AGACATGGAA GGCATCCAGA TAGCTGGCAT AGCTAATATG
ACCAGGCACT TAAAGGGGGT ACAAATTGCT GGTATTCTTA ACTATGCCAA AAAAATGGAT
GGTTTCCAGC TTGGCCTTAT CAATGTATCA GACACTTCAT CCGGTTACAG TTTAGGGTTG
ATAAACCTTG TAAAACATGG TTATCATAAA ATAAGCCTGT TTACCAACGA AACTGTAAAC
ACCAATCTTT CTATTAAAAC AGGCAATTCC CATCTTTATA CCATTTTATT TGCAGGCTTA
AACCTGTCAC AAAACGAAAA AGTACGAACT GTGGGTATAG GCCTTGGCCA TGATTTTATT
TTTAACAGCT GCTTGTCTGT TGGTCTTGAA ACAACTGGTC AGCTGCTCTA TCTCGGTAAG
TGGGACAGTA CCAACCTTTT GAGTAAAGTT CAGGCCAACC TGCAGGTACA GCTGGTTAAA
GGTATAAGCC TCTTTGCGGG CCCTGCCTAC GCTGTTTACA GCAGCGATAA CCCCGCCAAT
TCCAGTTCAG CAGGCTATAA GCAAAACATT GTTCCAAAGC ACCATACCAG CTTTGGCAGC
AACACAAAGG GATGGCTGGG TTTCAATGCC GGCATCACCT TCATGTAA
 
Protein sequence
MKLKLTLILS ILTGYLQAQM PLAPSKHNLE KRVSIEVKNT QISEVLTRVS RAGAFYFSYS 
GALFTTDSLV SLNVRNTPVR EILDRLFNNK VDYKENGEYI ILRYAANHLT IEPENITTAD
KLYLISGYVI DTETGRKVKQ ASVYEKRLLQ STLTDHEGFF KLRFKGDHNE VILTAAKENY
RDTTLVFLSD IKVKPEGYKD PNADEANGLF SDVENSGIGR FFISSKQRIQ SLNIPSFFAN
SPFQTSLTPG LSSHGIMSSQ VVNKFSLNVL GGYTAGTDGL EIAGLFNITK GDVKKLQFAG
LFNEAGGAVN GFQVAGLLNN VSGEKKGFQA AGLLNRVKGE TEGFQVAGLC NLSARSMKGV
QAAGIVNVIK ENVDGVQIAG IANLVRKDME GIQIAGIANM TRHLKGVQIA GILNYAKKMD
GFQLGLINVS DTSSGYSLGL INLVKHGYHK ISLFTNETVN TNLSIKTGNS HLYTILFAGL
NLSQNEKVRT VGIGLGHDFI FNSCLSVGLE TTGQLLYLGK WDSTNLLSKV QANLQVQLVK
GISLFAGPAY AVYSSDNPAN SSSAGYKQNI VPKHHTSFGS NTKGWLGFNA GITFM