Gene Phep_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4098 
Symbol 
ID8255232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4940475 
End bp4941584 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content43% 
IMG OID644937762 
Productcoagulation factor 5/8 type domain protein 
Protein accessionYP_003094351 
Protein GI255533979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACTA GAATATATTT ATCAATAGCA TTCGCAGGTG TTAGCATGTT TATTGCCTGT 
AAAAGAATGG ATAGTACTTA CGAGAAGTAT GTAGTTCCAA ATGGGATTAT CTACCCTGGA
AAAGCAACTT CACCCTCGGT TAAGCCAGGT CTTAACCGGA TACAGATCTC TTGGGCCCGT
GGTACAGATC CAAAAGTCGT AAAGGCTAGG ATTTTTTGGA ACAACTACAG GGATTCTGTG
GAAGTCCCTG TTCCGGCAGA TCAGGATATC ATCAGTTATA CCATTCCAAA TCTGGCAGAA
AATTACTATA CCTTTATCAT TAAAACTTAT GATAAGGATG GAAGGGTATC TGTTCCAGTA
GAAGTGTCTG GGGATGTATG GGGCGAACGC TATCGCAGCA CCTTGCTTAA CCGCCTCGCA
GTGTCATCCA CAATTAACCT TGCCAATACT TTAACCATTC AATTTGAACC GGTGCTGGCG
GGCAGTACAA TTGTGAAATC AGAGGTAGAA TACACTACAA CTGCAGGAGC TCTGAAAACA
GTAACGGTTT TACCAGGTAC GCTGTCGCTG CAATTAACAG ATTATAAAAA GGAAACAGGT
TATAAGCTGA GGACGCAGCA TTTAAATCCT ACTGCAATTG ATCCTTTTTA TACCAACGAT
CAGCTGGTAG ATGGATTTTT GTTAAACAAA ACCGAGTGGA AGATTGCAGC ATATTCTTCT
TACAATACTA CCGATGCAAA TAATGTAAGT GCTCCGGCCA ATATGATAGA TGGTAATCCA
GCCACCCGCT GGCTTTCATT GGTAAGTGCC AGTTACCCGC ATTTTGTTAC TGTCGATTTT
GGTACACAAA GAACACTAAG AACGGTTAGC TTATGGAGAT GGGTACAGAC AGCTCCTGAT
GAGCGCGGGC CTAACGTTGT GCAGTTTTTT GGAAGTCTGG ACAATACTAC CTGGACCGAC
CTGGGTACTT ACAATTTTGA CCGACTTACC AATAATGAGC AGATTTATAC CATTCCAAAT
CTGCCTCAGG CCAGGTACCT TATGGTTAAA GCCATAAGCG GGCCCCAGGT GTATGTAATT
CTGGGAGAAG TAAATGTTAC CGTAAAATAG
 
Protein sequence
MKTRIYLSIA FAGVSMFIAC KRMDSTYEKY VVPNGIIYPG KATSPSVKPG LNRIQISWAR 
GTDPKVVKAR IFWNNYRDSV EVPVPADQDI ISYTIPNLAE NYYTFIIKTY DKDGRVSVPV
EVSGDVWGER YRSTLLNRLA VSSTINLANT LTIQFEPVLA GSTIVKSEVE YTTTAGALKT
VTVLPGTLSL QLTDYKKETG YKLRTQHLNP TAIDPFYTND QLVDGFLLNK TEWKIAAYSS
YNTTDANNVS APANMIDGNP ATRWLSLVSA SYPHFVTVDF GTQRTLRTVS LWRWVQTAPD
ERGPNVVQFF GSLDNTTWTD LGTYNFDRLT NNEQIYTIPN LPQARYLMVK AISGPQVYVI
LGEVNVTVK