Gene Plim_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0521 
Symbol 
ID9137198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp649497 
End bp650645 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content54% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003628568 
Protein GI296120790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTC GATACTCACG CTATGACGAA AACAACGCGT TCTCCCCGCA ATCGGCCGAC 
GAACTCTTCG ACCAACTCTC CGAGTATATG TTGCAATACG GCGACGAAGT CCTCGACAAC
CTGAGTGACT GGGAAGAGCA ACAGCCCGAT GTCGTCGACA TGCTCATCCG CCAGGGTCTC
GTCGAGAAAG ACCGCGAAGG ACGGTACTCC GTCACCCCCA AAGGTCTTAA ACGCGTCGAG
AATCGGGCCC TCGATGAACT CTTCCAGGTC CAACGCAAAG ATTCCTTCGG CAAACATCAG
GTCGATTTTC GCGGCCCGGG CGAAGTCTTG CAGGATGAAT CCAAGAAATA CGAATTCGGA
GACGCCATCT CCAATCTCAA CCTGCACGAA ACCATGCGGT CTGCCATGTC TCGTCACGCC
CGGGAAGGCA AACTTGCTAA CCGGCAGATC CACATCCAGG AAGACGACCT CGTTCTTTAC
GATCAGCAAT ATCAGACCAA CTGTGCCACA GTCCTCCTGG TCGATATGTC CGGCAGCATG
ACTCGCATGG GTAAATACGG GTCGGCCAAA CGCGTCGCCA TGGCTTTACA GGCCCTGATC
AACGGGCGCT ACCAAGGAGA TTTTCTCCAG ATCGTCGGAT TCTATACTTA CGCCAGCCCA
TTAAGCTCCA AAGAACTCTT TGCCTCGGCC CCCAAACCTG TCAGCATGTA CGACCCACGC
ATCCGGCTGC GCATTTCTCT GGATAATTCA CCGGCATTCG TGCCACAACA CTTCACGAAT
ATCCATGCAG GGCTGCAATT CGCCCGCAGA ATTCTCAATA AGCAGCCCAC TCAGAATCGA
CAGATCCTCA TCGTCACCGA TGGCGAGCCC ACAGCCCATG TCGAGGGCAG AGATCTCATG
CTGATCTACC CTCCCAGCGA ACAGACCGCG CTGGCTACTT TAGCAGAAGC CAAACGCTGT
GCGGCAGAAG GGATCAGCAT CTCCAGCTTC GCACTCATTG AAGATTACTT CTACCTCGAA
CTGGTCAACT TCGTTCAACG CATGGCCGAA GTCACCGGCG GCATTTCAGC CTACTGCAAC
GCCGGCGACC TGGGAAACCT CGTCATTGAG AGCTTCATTA AAGGCCGCAA AAAACGCATG
GCGAGGTAA
 
Protein sequence
MDFRYSRYDE NNAFSPQSAD ELFDQLSEYM LQYGDEVLDN LSDWEEQQPD VVDMLIRQGL 
VEKDREGRYS VTPKGLKRVE NRALDELFQV QRKDSFGKHQ VDFRGPGEVL QDESKKYEFG
DAISNLNLHE TMRSAMSRHA REGKLANRQI HIQEDDLVLY DQQYQTNCAT VLLVDMSGSM
TRMGKYGSAK RVAMALQALI NGRYQGDFLQ IVGFYTYASP LSSKELFASA PKPVSMYDPR
IRLRISLDNS PAFVPQHFTN IHAGLQFARR ILNKQPTQNR QILIVTDGEP TAHVEGRDLM
LIYPPSEQTA LATLAEAKRC AAEGISISSF ALIEDYFYLE LVNFVQRMAE VTGGISAYCN
AGDLGNLVIE SFIKGRKKRM AR