Gene Plim_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_0223 
Symbol 
ID9136878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp287713 
End bp288903 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content53% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003628274 
Protein GI296120496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.508824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAA ATTCCTCAGG TCATCGACGA GTCATTCACA AGCCACATCC CAGGCGGGGT 
GCGATTGCGA TCCTGGCTGC CTTTGTGATG GTCGCGCTCC TGGCTTTGGC GGGGTTTTTT
CTTTCACTCT CCTATGTCGA ACTGACGCGT GCTGAACTCC GGGCCGCAAC CGACGCCGCT
GCTCGTTCGG CAGTGATCCG ACTGGTGGAA ACACAATCGA CAACTTCGGG CCGTGCTGCC
GCCCGCGATA TTGCCTCTCG TTTTGAAGTG GGAGGCAAGG CTCTTTCGTT AAACGATAAC
GATATTCAAT TTGGCAGATC GACTCGGCAG TCGAATGGCA GCTATTCGTT TGCGATCAAT
GGCACACCGA CGAATGCAGC GCGAGTTTTT GGTCGCAAGA CCAAAACATC GGCAGCGGGG
CCGGTGGAAC TTCCCTTTGG TGGTTTTGTC GGAGCTCCTG AGTATTCGAC AGAACTCAAT
GCCGTCGCTA TGCGGCTGGA CTATGACATT GTCATCGTGC TCGATCGATC AGGCTCGATG
GGTTGGGATC TATCGGGAGT TGAGTTCGAA TATCCTGAAG CTGTCCGACA AAGACCACTG
GTTGAAAACT ACTTCAGCCC GCCTGATCCC ACAGGAAGCC GATGGGCAAT TCTTTCAGCC
AGTGTGAATG ACTTTTTGAC GATTTTGAAT CAGCGTCAGG TAGCGGCTCG TGTGGGGCTC
GTGACTTATG CTGGCGACTA CACATTCGGT AAGTACAGCT CCGTCAAACT GACTGTGGAA
AGTGATCTTA CTTCCACCTT CTCAACGATT ACATCGAAAT TGACAGCTAT CGGGCAGGTA
CCACTCATTG GCGGGACAGA TATTGGTGCC GGGATTACAG CCGCTCAGAC GATGCTGACG
ACATCCAGCC AGGCTCGCCT CAAGACGGGC CAGCCGATCA TCATTGTCTT CAGCGATGGG
ATGTTTAATC AGGGGACAGA ACCTGTCAGT CTGGCAGCGA GTGCCTATTC GCAATCATCC
ACAATTATTC ATAGCGTGAC TTTTGGAGCT ACGGCTCAAG GTCGTGCCAC GATGAACTCT
GTGACGGCCA CTGCCGGCAA AGGCTTGAGC CTGCATGCCA ATACTGCTGC CGAACTGGCG
GAAAGTTTCC GATCGATTGC CAACGCGATT CCTATTGTGG TGACTGAATG A
 
Protein sequence
MRKNSSGHRR VIHKPHPRRG AIAILAAFVM VALLALAGFF LSLSYVELTR AELRAATDAA 
ARSAVIRLVE TQSTTSGRAA ARDIASRFEV GGKALSLNDN DIQFGRSTRQ SNGSYSFAIN
GTPTNAARVF GRKTKTSAAG PVELPFGGFV GAPEYSTELN AVAMRLDYDI VIVLDRSGSM
GWDLSGVEFE YPEAVRQRPL VENYFSPPDP TGSRWAILSA SVNDFLTILN QRQVAARVGL
VTYAGDYTFG KYSSVKLTVE SDLTSTFSTI TSKLTAIGQV PLIGGTDIGA GITAAQTMLT
TSSQARLKTG QPIIIVFSDG MFNQGTEPVS LAASAYSQSS TIIHSVTFGA TAQGRATMNS
VTATAGKGLS LHANTAAELA ESFRSIANAI PIVVTE