Gene Phep_3129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3129 
Symbol 
ID8254247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3736930 
End bp3738150 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content45% 
IMG OID644936782 
Productprotein of unknown function DUF214 
Protein accessionYP_003093387 
Protein GI255533015 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCA TCAACCTTAT CAGATTAGCC ATTAAAGCAC TGCAACGCAA TAAACTACGT 
GCATTACTCA CCATGCTGGG AATTATTATA GGGGTAGCTT CGGTAATTAC CATGATGTCT
ATCGGAGAGG GTTCCAAACA GAGCATTAAT GCTTCCCTGG CAAGTATGGG CTCTAATATG
ATTACCATTA TGCCTTACAG CAATGTACCA GGTGGCGCAC GTATGATGGG CAACAGTTTT
AAAACACTGA CCCTTAAAGA TGTAGAGGCA TTGAAAAAGA ATGCTGTTAA TATTGCAGAA
ATTTCTCCGC TGGTCTCTTC AAGCGGACAA TCGATAAGTG GCCCCAACAA CTGGCCAACA
AGCATCCAGG GTGTAAGCCC CGCATACCTT GACATCAGAA AACTTGTAGT GAAAGACGGT
ATAATATTTT CCGACCAGGA CATCAGGTCT TCGGCTAAAG TATGCCTGCT TGGCAAAACA
GTGATCGACA ATCTGTTTCC CAATGGGGAT GACCCGATCG GAAAGATCAT CAGATTTGGC
AAAATCCCTT TCCAGGTCAT CGGCACCCTA GTACCAAAGG GCACCAGTAA TTTCGGTCAG
GATCAGGACG ACATCATCAT AGCCCCTTAT ACCACTGTAC AGAAGAGGAT TACTTCCTCT
ATTTACTTCA ATCAGATTTA CGCATCAGCC ACCAGCGAAG CTGCCTCTGA TGCCGCTGTA
GCAGAAATAA CCAGCATTTT AAAAGACACA CATAGAATCA GGCCCGGTGA AGAAAATGAT
TTTCAGGTAA GGACACAGGC TGAACTGATG ACCATGATGA ACTCGACCAG CAGCATGATG
ACTGCCCTGT TAACCGCAGT TGCCAGTATT TCGCTGGTAA TTGGCGGAAT CGGTATCATG
AACATCATGT ATGTTTCTGT AACCGAAAGG ACACGCGAAA TTGGGTTGAG GATGTCGATC
GGGGCCAGGG GCATTGATAT TTTACTGCAA TTTTTAATTG AAGCCATAGT AATCAGTGTT
ACAGGTGGCC TGATCGGGGT GCTGCTTGGC ATTTCTGCTG CCATAGCTGT TCCCGCCTGG
TTGAACTGGC CAACTGTAAT TTCTGAATTT TCTATTGTGA TCTCCTTCCT GGTCTGTGCT
TTAACAGGTA TATTTTTTGG TTATTACCCC GCACTTAAAG CATCCAAACT GGATCCGATT
GAAGCGCTCA GGTATGAATA G
 
Protein sequence
MKFINLIRLA IKALQRNKLR ALLTMLGIII GVASVITMMS IGEGSKQSIN ASLASMGSNM 
ITIMPYSNVP GGARMMGNSF KTLTLKDVEA LKKNAVNIAE ISPLVSSSGQ SISGPNNWPT
SIQGVSPAYL DIRKLVVKDG IIFSDQDIRS SAKVCLLGKT VIDNLFPNGD DPIGKIIRFG
KIPFQVIGTL VPKGTSNFGQ DQDDIIIAPY TTVQKRITSS IYFNQIYASA TSEAASDAAV
AEITSILKDT HRIRPGEEND FQVRTQAELM TMMNSTSSMM TALLTAVASI SLVIGGIGIM
NIMYVSVTER TREIGLRMSI GARGIDILLQ FLIEAIVISV TGGLIGVLLG ISAAIAVPAW
LNWPTVISEF SIVISFLVCA LTGIFFGYYP ALKASKLDPI EALRYE