Gene Phep_1225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1225 
Symbol 
ID8252323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1443469 
End bp1444689 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content42% 
IMG OID644934879 
Productprotein of unknown function DUF214 
Protein accessionYP_003091504 
Protein GI255531132 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.594167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATACAG AATATTTTAT TGCAGGACGC ATAGCCATTA AATCGGAGCG TACCTTTTCT 
AAACTGATTG TTCGCATTGC CATAGCGGGG GTAATGCTCA GTCTGGCTGT GATGATGCTT
TCTGTAGCCA TTATAAAAGG TTTTAAAACC GAGATACAGG AAAAAGTAAG AGGATATATA
GGCGATGTAA GGGTTTTTAA ATATGATCTG AACAATTCTT TTGAACTCTC CCCTTTTGTA
CCTGCCAAAG AGACTTTGGC CCAGCTTAAA AATAACCCTG ATGTTGAGTT CTTTCAGCCC
TACGCTACTA AACCAGCTAT AATTTCAGCC AACAATGAAG TTGAAGGGAT CAATTTCAAA
GGGATCGACA AGACCTTTAA CTGGGATTAT ATCCGCAGAC ACCTGGTTAA TGGTAAGGTC
ATCGATTTTA CCGATAGTGT GAAGGCCAGT AAAGAGATCC TCATCTCGCA GTTTACCGCC
AACCGCTTAA AGCTAAAGGT AGGCGATGAT TTTATCATGT ATTTTGTACA GAACCCACCG
CGTAAGCGAC CTTTTAAGAT CGTGGGGATT TATGATATCG GCGTAGAAGA AATCGATAAG
AATTTTGTAA TCGGTGATTT AAATATCATC CGCAGGTTAA ACAACTGGAA AGCCAATGAA
ATAGGCGGAC TGGAAATCAG GATTAAAGAT TTCTCCCGAT TAAAGGAAGT CTCAACACAC
ATTTACGAAA ATATGGAGCT GAAGCTGAAA TCGGAGTCGG TTTCTGATTA TTTTCCTGCA
ATTTTTACCT GGCTGTCCTT ACTGGATGTG AACACCAAAG TGTTACTGGT TTTAATGATG
GTGGTTGGTG TCATCAATAT GGTTACCGCC TTGTTGATCA TGATCCTGGA ACGCACCAAT
ATGATCGGTA TCATGAAGGC ATTTGGCATG ACGGATTACA GTGTGATGAA AATATTTTTG
TACAATGCCG CTTATCTGGT AGGGCTGGGC TTATTGCTGG GCAATATACT GGGGCTGGGG
CTGGGCTTCC TGCAAAAATA TACACATATT TACAAACTGG ACCAGTCTTC TTATTACCTG
TCGTATGTGC CCATCGAGCT TCATTTGGCA GATGTACTGC TCCTGAACCT GGCTACTATG
GTGATCTGTG TGCTTGTACT GATCCTGCCC TCTATGCTGG TCAGCCGGAT CAGCCCTTTA
AAAGCCATCA GGTTTAAGTA A
 
Protein sequence
MNTEYFIAGR IAIKSERTFS KLIVRIAIAG VMLSLAVMML SVAIIKGFKT EIQEKVRGYI 
GDVRVFKYDL NNSFELSPFV PAKETLAQLK NNPDVEFFQP YATKPAIISA NNEVEGINFK
GIDKTFNWDY IRRHLVNGKV IDFTDSVKAS KEILISQFTA NRLKLKVGDD FIMYFVQNPP
RKRPFKIVGI YDIGVEEIDK NFVIGDLNII RRLNNWKANE IGGLEIRIKD FSRLKEVSTH
IYENMELKLK SESVSDYFPA IFTWLSLLDV NTKVLLVLMM VVGVINMVTA LLIMILERTN
MIGIMKAFGM TDYSVMKIFL YNAAYLVGLG LLLGNILGLG LGFLQKYTHI YKLDQSSYYL
SYVPIELHLA DVLLLNLATM VICVLVLILP SMLVSRISPL KAIRFK