Gene Phep_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1944 
Symbol 
ID8253048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2244198 
End bp2245535 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content40% 
IMG OID644935595 
ProductOmpA/MotB domain protein 
Protein accessionYP_003092214 
Protein GI255531842 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.000859012 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTCAA GAATTACAAA AACAGCTTTA GCGCTGTCCC TAATAGGGTT ATCAACGCAA 
TTATTTGCAC AAGATTCAGG TACCCAAGGA GGTAGATTTT CAGAGAAAAC TTTTCGCACA
TGGTCGGTTG GTATTCATGG CGGTGTATTG AGCCAAAATA CAATATTCAA TGGTAAAGAA
CGCGACTTTC AAACTGCCAA AGAAAATATC GGTTATGGCG CCTTTATTAA AAAACAAATA
TTGCCTTCAT TAGGTATCCA GGCTGATTTC CTGGGAGGAA AAGTTGAAGG ATTAAGGTCT
TATGCCGGTG TAGATGCAAA TGGCGACAAT ACCTATCTGA ATGGCTCCAG CTATCAAACT
AAAATAGACT GGTCTGCTGC CTTAACCGCT GTTTATAACA TTGCCAACAT CAATATCAAC
AATGAAAATG CAGTATTGAT TCCTTATGTT AAAGCTGGTG CTGGTTACAT GAACGCCGGT
GCAACAACCA CTAACGTTCC GTTGTCTGCC AATGAAGGTT ATAAAGAAAG ATGGTTTGTT
CCAGTTGGTG CTGGTTTTAA ACTGGGTGTT GCCAAAGGCA TCAATGTTGA CTTAGGATAC
GATGTAAACT TTGTTAAATC TGACAAATTT GATGGTTTCA ATTCAGGTGG TAAAAATGAC
AAATTCTCTT ATGGACATAT AGGTTTAGAA TTTGCACTGG GAAGCAAAGA AAAGCCACAA
TTGCAAAACT ACAGCTCATT GGCCAACCTG CGTAAACAGT CAAAAGAAGA ATCTGATGAA
TTGAGAAGAG CATTGTCTAC TGCTGAACAG AATGCAGCCA GAGATAGAGA ACAATATGCT
AAAGACATGG GAGATGATGA CAATGACGGT GTAGCCAATA AATTCGATAA ATGTCCTGGA
ACAGCATCTG GTACAGTTGT AGACGGATCT GGCTGCCCTA TCAAGGTACA GCGGGAAGTC
ATCAAAGAAA CTAAAGTTGT AGTAACAGAG GCCGACCGTA AAGTGGTTGA TGAAGCCATC
AAAAACTTAG AATTTGACTT GGGTAAAGCT ACAATAAGGG CTAAATCTTA TGCCACTTTA
AATAAAGTAG CTGCTTTGCT AATCGAGAAA AACTTTAGCC TGAAATTAGC CGGACATACA
GATAACACCG GATCTATGGC GTTAAACTTA CGTTTGTCTA AAGAAAGAGC TGAAGCTATC
AAAGCATATT TGGTATCACA GGGTGCAAAT GCTTCACGTA TCGAAGCAAC AGGTTATGGT
CCGAACCAAC CTATCGCATC TAATAAAACT GCCGAAGGTC GTCAGAAAAA CCGTAGGGTA
GAATTTACAT TGTACTAG
 
Protein sequence
MNSRITKTAL ALSLIGLSTQ LFAQDSGTQG GRFSEKTFRT WSVGIHGGVL SQNTIFNGKE 
RDFQTAKENI GYGAFIKKQI LPSLGIQADF LGGKVEGLRS YAGVDANGDN TYLNGSSYQT
KIDWSAALTA VYNIANININ NENAVLIPYV KAGAGYMNAG ATTTNVPLSA NEGYKERWFV
PVGAGFKLGV AKGINVDLGY DVNFVKSDKF DGFNSGGKND KFSYGHIGLE FALGSKEKPQ
LQNYSSLANL RKQSKEESDE LRRALSTAEQ NAARDREQYA KDMGDDDNDG VANKFDKCPG
TASGTVVDGS GCPIKVQREV IKETKVVVTE ADRKVVDEAI KNLEFDLGKA TIRAKSYATL
NKVAALLIEK NFSLKLAGHT DNTGSMALNL RLSKERAEAI KAYLVSQGAN ASRIEATGYG
PNQPIASNKT AEGRQKNRRV EFTLY