Gene Phep_1592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1592 
Symbol 
ID8252694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1882989 
End bp1884992 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content39% 
IMG OID644935246 
Producttransglutaminase domain protein 
Protein accessionYP_003091867 
Protein GI255531495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.184226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA AAAATCTACT CGTATTATGC CTATTGCTAC TGGTATCGTT CTCGTATGCC 
CAGGAAAATG TAGTCAGCAA ATCAAAAACC TTCAAATATG GAAAAATAGA ACCCGCCGAA
TTTGACAGTA AAGGAAGTGG TGCGGATTCG GCTGCAGCAG CCCTGATCTT GTTTGATGTA
GGGAGAGGAT ATTTTGAGCT CAGCCCTAAG ACCGGTGACT TTGCATTCGT TTTTGAAAGA
CATATCCGTT ACAAGATCAT CAATAAGAGC GGGTACGATT ACGGAAACCT GGAACTGAGG
TTTTACAAAC AAAATTCTTC GGAAGAAAAG CTGGACTACA TGGATGCGGC CACATATAAC
CTTGAAGGTG ATAAAATTGT GATCAGCAAG ATAAATAAAG ATGCCAAATT CTCAGAAAAA
CAGGATAAAA ATTTTACGCT TAAAAAGTTT GCCCTGCCCA ATGTAAAAGA AGGATCTATT
GTTGAATATA AATACAAAAC AAAATCCGAT TTTATCTTTA ACCTGCAGCC CTGGTATTTT
CAGAGAAGCA TCCCTACCCT ATATTCTGAG TACGAAGTCA ATATTCCGGA ATATTACAAA
TACAAGATAA GATCTGGCGG GTACCTGTTT TTAAACCCCA AACAGGAATA TGTGAACGAA
ACTTTTAGTT CGAGCGCAGG AACACTGAAT GCCTCATGCT TAAAACTGCA TTATCAGGTT
GAAAATGTGC CGGGTTTAAA GAAAGAGAAT TTCATTACTA CGATGGAAGA TTATGTGAGC
AAAGTAGGTT TTGAACTGAG CTCTGTAACA GTACCCGGAC AGGTTTACAG GGAATTTACT
TCTTCCTGGC CAAAAATTGT TACCGGACTT AAAACAGAAG AAAACTTCGG GGCTTTTATA
AATAAAAAAA GTTACAGCAA AAGCATTTTA AAGGACATTG TAAAAACTGC AACACAGCCT
GATACGGTAT TACAGCTCAT TTTTAACTAT GTAAAAAATA ACATTAAATG GGACGGTAAT
TACCGCTTGT ATACTTCGGA AACCAGCCCA AAAAACATAT TTGAGAAGAA AACCGGTAAC
TCGGCCGATA TTAACCTTTG TCTGCTCACA CTTTTAAATG AAGCAAACAT TACAGCATCA
CCGGTTTTAC TGAGTACAAG GGAAAATGGG GCACATCCTG GTTTTCCAAT GATTACGGAT
TTTAACAATG TGATCGTTCA GGCCGAAATT GGCGATAAAA TGATCCTTTT GGACGCCACA
GATAAAGACC ACGCCCTGAA CATGATTGCC TATGAAAACC TAAACCATCA GGGCTTAAAA
GTAAACCTGC CTGATGCCAC TGCTGCATGG ATATCACTGG ACGAGGCCAA TCTGAGCAAG
ACAAACATCA ATTTAATGCT GACTCTGGAC AAAGAAAACA AATTCAGTGG TAAACTGTAT
CTGTCGTCTA CCCATTATGA GGCTTTAAAC CGCAGGGGAA AATACCGTTC GGCGACAAAT
GAGACTGATT TTCTGAAAGA TTATAAAACC GACAGACCGG GTCTTGGTAT AAAAAACTAT
CAGATCCAGA ACCTGGCCAA CCTGGCCGAA CCTTTGGTGG AAAGCATGGA TGTAACCATT
GAGGACAATG TGGAAGAAGC CGGAAACCTG GCCTATTTTG CCCCGCTATT GTTTGAAAGG
ACAAAGGAAA ATCCCTTTAA ACTGGAAGAA AGGATTTATC CGGTCGACTT TGCTTATCCT
AAGGAAGAAA ATTACCGCAT CACAATCGAT TTTCCAAAAG AATACCATTT GGATAAATCG
CCCAAAAATG AAAAAGTAGT TTTACCGGAT GATGCTGCTT CCTTTGTTTT CATGTTTGCT
GCTGAAGAAA ATAAGCTGAT GATCACCAGC AAAATATCCT TGAAAAAAGC ATTCTTCACG
CCGGAAGAAT ACCACTACTT AAAAGAGCTT TTTAAGAATA TTGTAAGAAA ACAGGCAGAA
CAAATTGTAT TTAAGAAAAG TTAA
 
Protein sequence
MRKKNLLVLC LLLLVSFSYA QENVVSKSKT FKYGKIEPAE FDSKGSGADS AAAALILFDV 
GRGYFELSPK TGDFAFVFER HIRYKIINKS GYDYGNLELR FYKQNSSEEK LDYMDAATYN
LEGDKIVISK INKDAKFSEK QDKNFTLKKF ALPNVKEGSI VEYKYKTKSD FIFNLQPWYF
QRSIPTLYSE YEVNIPEYYK YKIRSGGYLF LNPKQEYVNE TFSSSAGTLN ASCLKLHYQV
ENVPGLKKEN FITTMEDYVS KVGFELSSVT VPGQVYREFT SSWPKIVTGL KTEENFGAFI
NKKSYSKSIL KDIVKTATQP DTVLQLIFNY VKNNIKWDGN YRLYTSETSP KNIFEKKTGN
SADINLCLLT LLNEANITAS PVLLSTRENG AHPGFPMITD FNNVIVQAEI GDKMILLDAT
DKDHALNMIA YENLNHQGLK VNLPDATAAW ISLDEANLSK TNINLMLTLD KENKFSGKLY
LSSTHYEALN RRGKYRSATN ETDFLKDYKT DRPGLGIKNY QIQNLANLAE PLVESMDVTI
EDNVEEAGNL AYFAPLLFER TKENPFKLEE RIYPVDFAYP KEENYRITID FPKEYHLDKS
PKNEKVVLPD DAASFVFMFA AEENKLMITS KISLKKAFFT PEEYHYLKEL FKNIVRKQAE
QIVFKKS