Gene Phep_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0868 
Symbol 
ID8251962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1024028 
End bp1024996 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content39% 
IMG OID644934523 
Productvon Willebrand factor type A 
Protein accessionYP_003091152 
Protein GI255530780 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.721273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00379424 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGCG TTCATAATTA TTATGAACGC TTTTTTTATA TTTACAGCAT GCAGTCGCCA 
ATTGATGAAA CAACCTTACA TTTTAATGGC AACCTTGAAT TACTGGCCAG GCAGGTAGTG
GAAGGTTTTA TTACCGGATT GCACAAAAGC CCTTTTCACG GTTTTTCTGT TGAATTTGCA
GAACACCGGC TTTATAATGT TGGCGATAAT GTTAAAAACA TCGACTGGAA ACTTTATGCC
AGGACCGACA AGCTTTTTAG CAAACGCTAT GAAGAAGAAA CCAATCTGCG CTGTCAGTTC
ATCATCGATG TTTCTTCGTC CATGTATTTC CCTTTAAAAG CATACAATAA GCTTAATTTT
TCTGTACAGG CGGTTGCTGC ATTGGTTTAT TTGCTAAAAA GGCAGCGCGA TGCCTTTGGC
CTAAGTCTTT TTACCGATCA GCTGGTATTG AATACCCCGG CTAAATCTAC CACCACCCAT
CAAAAATACC TGTTCGCAAG ACTGGAAGAA ATTTTAAAAG CCGAGCAGAT GAACGTAAAG
ACCAATCTTG ACCAGGCTTT GCACCAAATT GCAGAGCTGA TCCATAAACG TTCCCTGGTG
GTTGTGTTTA GTGATCTGCT CAGTACTGCT CAGGATGAAC ATCAGATCGA AGGTTTATTC
TCGGCCCTTC AGCACCTGAA GTTCAATAAA CATGAAGTCA TCATTTTTAA TGTGACAGAC
AAAGCAAAAG AAGTAGATTT TAAATTTGAG AACCGTCCTT ATCAATTTGT TGATATGGAA
ACGGGTGCTA TACTAAAAGC ACATACCTCA AAAGTTAAAG ATGCCTATCT GTTAAAGATG
CAGGCCTACA GGCAGGCCAT TCAGCTTAAA TGTGCACAGT ACAAAATTGA TATGGTTGAT
GCAGATATTG CCAAAGGATT TTACCCCATA TTACAGGCTT ATCTAATCAA GCGTCAAAAA
ATGAGTTAA
 
Protein sequence
MKSVHNYYER FFYIYSMQSP IDETTLHFNG NLELLARQVV EGFITGLHKS PFHGFSVEFA 
EHRLYNVGDN VKNIDWKLYA RTDKLFSKRY EEETNLRCQF IIDVSSSMYF PLKAYNKLNF
SVQAVAALVY LLKRQRDAFG LSLFTDQLVL NTPAKSTTTH QKYLFARLEE ILKAEQMNVK
TNLDQALHQI AELIHKRSLV VVFSDLLSTA QDEHQIEGLF SALQHLKFNK HEVIIFNVTD
KAKEVDFKFE NRPYQFVDME TGAILKAHTS KVKDAYLLKM QAYRQAIQLK CAQYKIDMVD
ADIAKGFYPI LQAYLIKRQK MS