Gene Phep_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2943 
Symbol 
ID8254054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3507892 
End bp3509028 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content45% 
IMG OID644936591 
Producthypothetical protein 
Protein accessionYP_003093203 
Protein GI255532831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.366822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA TAATTGGTTT TTTACTGTTG TTCCCTGCAC TGTCTTCAGC TCTGATGGCA 
CAGCAAACCG GTGCCAAAGC AATCCTAACA GTAAGTAATC CTGCAAAAAT AGAGCGGATA
AATGCAGTTG TAAGCCTTAG CTGGGCTCAG ATAACGGCTA AATATCCGCG TATTGATACG
GCGAATTTTA AGGTGCTCAA TGCGCTTACC AAAAAAGAAG TTCCTTTTCA GCTGGAACAC
CGCGGACAGG CATCGATACA AAACCTGTTG GTGCAGCTTA GCCTGAAAGC AAATGGCACT
GCCAAACTTT TGATCGTTGC TGGAAAACCA GCGCCGGTTG CAAAAAAAGC TTATGGCCGC
TATGTGCCTG AACGTTTTGA CGATTTTGCC TGGGAAAACG ATAAAGTAGC TTTCCGCATG
TATGGCAAAG CCCTGGAAGG AAGAAAAGAC AATGCTTTTG GAACCGATGT ATGGGTAAAA
AGGACAAGTA AACTGGTGAT CAACGACTGG TACAAGACAG GTGATTACCA TACGGACCAC
GGTGATGGAA TGGATTATTA CAGTGTAGGC CTGACCCTGG GTGCTGGTGA TATTGCTCCT
TATGTAAAAG ATTCTGTTTA TTTTCCTTTA AATTACCACA ACTGGAAGGT GCTCGACAAT
GGACCGCTGC GCACCACGTT CCAGCTCGGT TATGATGCCT GGGATGTAGC CGGAAAATCT
GTGAAAGTGC TTAAAACCAT CAGTCTGGAT GCCGGTTCGC ACCTGAACAG GGTGGAGGCA
GTGTATACTT ATAACGGTGA TATGCTGCCG GTAGTCGTAG GTATCGTGAA AAGAAAAGAA
CCCGGAACCA TTTTAATGGA TGAGCAGCAA GGTATTTTAG GATACTGGGA ACCTCAGCAC
GGTGTTGATG GTACCACTGG CGTAGCCACC ATTGTTACAG GTCAGCCTGT AACAATGGAT
ACCGATAAGA CCCATTTACT TAGCCATGCT ACTGCCAAAA ATGGCGAGCC TGTAGTATAT
TATAATGGCT CTGCATGGAG CAAGGGAAAT GAAATAACCA CGGCCCAGGC CTGGTTTAAT
TATCTGAATA ATTTTAAACA GCAATTGCAG CAGCCTTTAA AGGTAAGTGT GCAATAA
 
Protein sequence
MKRIIGFLLL FPALSSALMA QQTGAKAILT VSNPAKIERI NAVVSLSWAQ ITAKYPRIDT 
ANFKVLNALT KKEVPFQLEH RGQASIQNLL VQLSLKANGT AKLLIVAGKP APVAKKAYGR
YVPERFDDFA WENDKVAFRM YGKALEGRKD NAFGTDVWVK RTSKLVINDW YKTGDYHTDH
GDGMDYYSVG LTLGAGDIAP YVKDSVYFPL NYHNWKVLDN GPLRTTFQLG YDAWDVAGKS
VKVLKTISLD AGSHLNRVEA VYTYNGDMLP VVVGIVKRKE PGTILMDEQQ GILGYWEPQH
GVDGTTGVAT IVTGQPVTMD TDKTHLLSHA TAKNGEPVVY YNGSAWSKGN EITTAQAWFN
YLNNFKQQLQ QPLKVSVQ