Gene Plim_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4121 
Symbol 
ID9140841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5286676 
End bp5287848 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content54% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003632131 
Protein GI296124353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTGCCT TGCAGCTCGG TCATCAACCA CATCGCCAGG GGGCCATGCT GGTCCTGGTC 
GCCGTTGTGA TTGTGGCACT TCTGGCCATG ACGATGTTCA CCGTCGATGT GGCCTACATG
CAATTGGTGC GTACTGAACT CCGTGCTGCG ACCGATGCCT CTGCCAAAGC CGGGATGGAA
GCGCTGCGTC GTACTCAGGA TACCGAAGCA GCCATTGACG CTGCCATTGC CACTGCTGCT
GCTAACAAAG TCGGTGGACG ATCTTTGACC CTCACTGCCG ATCAGATCGA GTTTGGACTG
GCTTTTCGAA ATGTGGATAA CTCCGTTTCA TTCAATGCGG GGCAGTTGCC ATATACTGCT
GTCCGCGTGA ACTCAGCGAT GACTGAATCC TCTGCCGCCG GGGCTGTCCC CCTGTTTTTT
GGCAGTATTT TCGGGACGGG CCAGTTCGAG CCGACTCGAT CCGCCGTCTC AGCGAGTACT
GAAGTTGAAA TCTGCTTTGC GATCGACCGG TCACACTCAA TGTGTTTCGA CCTGACGGGT
GTCGATTGGT CTTATCCTCC CGGGACTCCA CGCAATCCAG ATCCCGTCGC ATTTCCTCCG
CATCCCACAC TCAGTCGCTG GGCCTCACTC TCTCGAGCCA TGCAGACATT TGTGAGCATT
ACCGCTTCTC AGGAACCAAA ACCGCGTGTG GCAATGGTGA CCTGGGCCTC CAAAATCACT
CAGTCGAACT ACGAAGGCAA ACTCACCAAA ACCAACAGTC CGGAAGTTTT TGTTGATGTT
CCTCTTACAA CCAATCTGGC CGACCTCAAT CAGGCCATCA AAGGGCGCTC GGAAAAGGTC
ATGCTCGGTG CCACCAATAT GGCTGCCGGA ATCGACGAAG CTCGCAAAAT CCTCAATGCG
ACAAAAAGTA CGCGCCCTTA TGCTCATCGG ATCATCATTC TCATGACCGA TGGTCTCTGG
AATCAGGGGC GTAATCCGCT ACTGGCCGCA CAGGATGCCG CTAACGAAGG AATTGTGATT
CATTCCGTCA GTCTGTTGCC GCGAAGTGGA GATATCACAC CACAGGTCTC CAGCACCACC
GGTGGTGTCA ATTACCCTGC TACCAACAGT GCCGCTCTCG AAGCCGCCTT CGCTGATATT
GCTCGAACTT TGCCCATTGT TCTCACGGAA TAA
 
Protein sequence
MPALQLGHQP HRQGAMLVLV AVVIVALLAM TMFTVDVAYM QLVRTELRAA TDASAKAGME 
ALRRTQDTEA AIDAAIATAA ANKVGGRSLT LTADQIEFGL AFRNVDNSVS FNAGQLPYTA
VRVNSAMTES SAAGAVPLFF GSIFGTGQFE PTRSAVSAST EVEICFAIDR SHSMCFDLTG
VDWSYPPGTP RNPDPVAFPP HPTLSRWASL SRAMQTFVSI TASQEPKPRV AMVTWASKIT
QSNYEGKLTK TNSPEVFVDV PLTTNLADLN QAIKGRSEKV MLGATNMAAG IDEARKILNA
TKSTRPYAHR IIILMTDGLW NQGRNPLLAA QDAANEGIVI HSVSLLPRSG DITPQVSSTT
GGVNYPATNS AALEAAFADI ARTLPIVLTE