Gene Phep_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1921 
Symbol 
ID8253025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2222603 
End bp2223733 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content37% 
IMG OID644935572 
Producthypothetical protein 
Protein accessionYP_003092191 
Protein GI255531819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.223609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.136115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATGA AATTAATATA TAAAACAAAC ATGATGAACA ACTTTAACTT TAAAAAACAG 
GCGCTGGTTA TACTATCGGC CTTCGTTTTG GTATTGGGCG CCTGTAAAAA GGATAAATTG
CCGCCAATTG ACCCTCAGCC AAGTGCTACA ACCGGAGTGT ATGTGTTGTG TGAAACCGGT
TATGGAAAAA TAGGAACAAT TACCTATTAT GAAGTAAATA CCGGCGCTGC TATACAGGAT
TACTATAAAA AACAGAATGG CATTGATCTG GGTGTGAACA CAAGCGACCT GAAACAATAC
GGCAGTAAAA TGTATGCCGT AGTTACCGGT ACGGATAAAG CCAGTAAGGA TTCATATATA
GATGTAATGA GTATAGCGAC AGGTAAGTCG TTAAAAAGAA TTCCTTTTTC GGATGCGACT
TCAGGCTTTT TACCACGTTA TATTGCGTTT TACAAGAACA AAGCTTATGT ATCTGGTTAC
GACGGTTATG TTACCAGGAT AGATACAGCG GGTTTAACTG TTGAATCGAG ACTTCAGGTA
GGCGGGGCGC TGGAGCAGCT GACAATTGTA AATGGTAAAC TGTATGTTAC AAACTCAGCC
CATTTTATGT ATGCAACCAG CAATAACTCA TCAGTATCTG TTGTAGACCT GAATAACTTT
AACAAGTTAA AAGACATTCC GGTAGGCTTT AATCCTACTA AAATTTCTGC AACAGGTTCG
GGTGAACTGT TTGTGGTTAC AAGAGGTAAT TATGGTAATA TCTCACCATC ATTAGATAAA
TTAAGTAGTG TTAGCGATAC TAAAACAGGA ACTGAAGCAT TAGATGTTGA GTATTTGAAT
ATAACAGGTA ATAAAGGTTT TGTAATTGGT CCGTATGGTA ATGAATTTCT AAAAAATATA
AATGTAAGTT CCGGCGTACT GGGTACTGAT TTTGTAACAG ATGCTACACC AGTTATTTTA
CCTTATGCTG TTACGGTAAA CCCGTTAAGT AATGATATAT TTGTATCTGA TGCGAATGGT
TATGCTTTAG TGGGTAAAAC ATTTTGCTTT GGTGCCGATG GTAAGAAGAA ATTTGAATTT
GCCACCGGGG GATCGCCACA AAGTGCAGTA TTTAAATACA GCTATAAATA A
 
Protein sequence
MRMKLIYKTN MMNNFNFKKQ ALVILSAFVL VLGACKKDKL PPIDPQPSAT TGVYVLCETG 
YGKIGTITYY EVNTGAAIQD YYKKQNGIDL GVNTSDLKQY GSKMYAVVTG TDKASKDSYI
DVMSIATGKS LKRIPFSDAT SGFLPRYIAF YKNKAYVSGY DGYVTRIDTA GLTVESRLQV
GGALEQLTIV NGKLYVTNSA HFMYATSNNS SVSVVDLNNF NKLKDIPVGF NPTKISATGS
GELFVVTRGN YGNISPSLDK LSSVSDTKTG TEALDVEYLN ITGNKGFVIG PYGNEFLKNI
NVSSGVLGTD FVTDATPVIL PYAVTVNPLS NDIFVSDANG YALVGKTFCF GADGKKKFEF
ATGGSPQSAV FKYSYK