Gene Phep_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1887 
Symbol 
ID8252991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2180044 
End bp2182014 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content41% 
IMG OID644935538 
Producttransglutaminase domain protein 
Protein accessionYP_003092157 
Protein GI255531785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATC TTCTTTTTCT GTTTGTATTT TTTTGTTCCA TTGCTGTAAG GGCACAGGAT 
TTTGGATTTG GGGAAATCAG CGGGGATGAT CTTGACTTAA AAAAGGTAAA AACAGACAGC
AATGCCAATG CAGTAGTGTT AAAGGAATTT GGTACGGCAT CAGTTCGTTT AGACGAAAGT
TATGGCAATC TTTATATAGA TTTTGAATAC CATGTCAGGA TAAAAATACT GAACAAAAAT
GGCTTCGGGA GTGCAAATGT TGTCATTCCC CAAAGAATTT ATGGCGATAA GGAAGACATG
GTCCAGAACC TGAAAGCAGT GACCATAAAT TATATAGACG GGCAATTTAC ACAAACACCG
CTGGATAAAA AAAAGGTGTT TACCGAGAAA AAGAACAAAT ATGTGGTACT GACTAAATTT
ACCATGCCCA ATCTGGTTGA GGGCAGTATT ATTGAATACA GCTACCGCCT CTATTCGCAT
GGCCTTTTTA ACTTCAGGAG CTGGGAATTC CAGTCGGACA TCCCGAAACT GTACAGCGAA
TACATTGCCA TTATCCCTGC CCTTTACACT TATAATGTAT CCTTACGTGG GGCACAGAAG
CTCAGTTCAC AAAATGCTGA ACTCTATAAA GAATGCCTCA GAATTTCAGG AAGGCCCTAT
GACTGTTCAA AAATGACCTA TATCATGAAA GATATCCCGG CATTGGTTGA AGAAGATTAC
ATGACTGCCC CAAGTAATTT CAGGTCGGCC ATTAATTTTG AACTCTCCGA ATACTACCTG
CTGTCGGGCG GGAAGAAAAG TGTAACCAAG GAATGGAAGG ATGTGGACTT TGAGCTGATA
AATGACAAAT CATTTGGCAG CCAGATGAAA AGAAAAGACC TTTTTAAAGA GCTGTTGCCA
GAGATCCTGA AAAACAAGAC TGCACCGCTG GACAAGGCAA AGGAAATCTA TGATTACATT
AAGCGCAACA TCAAGCGAAA CGGATTTATT GGAATTCAAA GTGAAAATAC AATAAAAAAA
GCTTTGGAAA CCCATTCCGG CAATACGGCG GACATTAACC TGGCGCTGGT AGCTGCATTA
AGTGCAGCGA ATCTGGATGC AGAAGCGGTT ATCCTTTCGA CCCGTTCCAA TGGCACTGTG
AATAACTTAT ACCCCGTGAT CACTGATTTT AATTATGTAA TAGCTAAGGT AAACATTGAG
GGAAAAAGCT ATTTGCTGGA TGCAACAGAG CCTTTAATGC CATTTGGTTT GCTGCCACTC
CATTGCATTA ACGGACAGGG AAGGGTAATC AACCTGAAAA AACCCTCCTA CTGGTATGAC
CTTAAAGCGA GTCAGAAAGA AACACTCCGG TACAGTTTAA TTGCTGAGCT GGGAAAAGAT
GGAAAAATAC GGGGTAACCT GACCATCCAC GCCATTGGTT ATGCGGCCTA TAATAAACGT
AAAAAGATCC TGGCAGCAAG TTCGGTGGAT GAATATGTAG AGAAGCTGGA TGAGAGTATG
CCCCAGATCA GGATCCTTAA ACATGCAATC CATAACCTGG ACAGCCTGGA AAACCTGCTT
ACCGAAAATT ATGAAGTTGA AATGTCGGCC TTTTCCAACC TCAATAGTGA CCCTTTATTT
TTTAACCCGT TTTTTATCGA CCGGATCAGC AAAAATCCTT TCAATTTAAA TGAGCGTACC
TATCCTGTAG ATCTGGGTGC AGAAAAGGAA ATCCGCATCA ACATGACAAT TAAACTGCCT
GATAACTATA ATTTGGCCGA CAAGCCTAAA GAACTGAACA TGGTACTGGC CGATGCGGGT
GGCAGGTTTA TCTGTACAAC TGCTGTTGAA GACAATATCC TGCTGTTTAA CCAGCTGATG
CAGCTTAACA AACCAATTTA TAGCTCTGCA GAATACCTTT CACTTAAGGA GTTCTACAGC
AGGATCATCC AATTGCAGAA AACGGATATT ATCCTTAAAA AATCAAAATA G
 
Protein sequence
MKYLLFLFVF FCSIAVRAQD FGFGEISGDD LDLKKVKTDS NANAVVLKEF GTASVRLDES 
YGNLYIDFEY HVRIKILNKN GFGSANVVIP QRIYGDKEDM VQNLKAVTIN YIDGQFTQTP
LDKKKVFTEK KNKYVVLTKF TMPNLVEGSI IEYSYRLYSH GLFNFRSWEF QSDIPKLYSE
YIAIIPALYT YNVSLRGAQK LSSQNAELYK ECLRISGRPY DCSKMTYIMK DIPALVEEDY
MTAPSNFRSA INFELSEYYL LSGGKKSVTK EWKDVDFELI NDKSFGSQMK RKDLFKELLP
EILKNKTAPL DKAKEIYDYI KRNIKRNGFI GIQSENTIKK ALETHSGNTA DINLALVAAL
SAANLDAEAV ILSTRSNGTV NNLYPVITDF NYVIAKVNIE GKSYLLDATE PLMPFGLLPL
HCINGQGRVI NLKKPSYWYD LKASQKETLR YSLIAELGKD GKIRGNLTIH AIGYAAYNKR
KKILAASSVD EYVEKLDESM PQIRILKHAI HNLDSLENLL TENYEVEMSA FSNLNSDPLF
FNPFFIDRIS KNPFNLNERT YPVDLGAEKE IRINMTIKLP DNYNLADKPK ELNMVLADAG
GRFICTTAVE DNILLFNQLM QLNKPIYSSA EYLSLKEFYS RIIQLQKTDI ILKKSK