Gene Plim_3001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3001 
Symbol 
ID9139713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3884609 
End bp3885706 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content55% 
IMG OID 
Productvon Willebrand factor type A 
Protein accessionYP_003631022 
Protein GI296123244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC CTCAAAGAGC TGTGACCGAA CCAGTAATTC AGGTCTCACA GGGAGCAGCC 
AGACAAAGTG CGCTGGTAGG CTGGCTCCTT TCCGGCTGTC TCCATCTGAT CGCTATCGTC
CTGCTCCTCT TTGTCATTCA AGGACAAAGT GTTGAACGCG TTGGTTTCGG CAATGGTGCT
GAAGGTCCAC CCGGTCTCGT AATGGTGGAT GCCAGTGATG GTGGGTTGAC GCAAGAAGCC
GCACCATCAG AAATTGGAAA TGGGAATCGG CCATTACGAA CGGCAGCAGG GAGCAGCGGG
ACTGGTCGCG AAGTTCAATC GACAGAACTC CCGGCTGATG TACCACCAGT TCCACTCTCC
TTGCCGAAAG CTGGTGCAGC AGGACTGGGT TCTGCCCGCG GCAACCTCGC CTCAGAGTTC
TCTCCCACGG CGGGAACTGG CAATAGTGGA ACCGGCCTGA CAGGGACAGG AACCAGTGGT
GATCTGCGGG ATCTGATCGA AGGGACAGGA ACACGAAAAC CGGGAACTGG CCTCGGTGCT
GCCACACCTG GCACCAGCTT CATGGGGATC AAAGATCAGG GAAGCCGGGT CGTGTTTGTG
ATCGATTGCT CGGGAAGCAT GACCAACTAC AACGCGATGC GAGTCGCGAA GACGGCACTG
GTCAGCAGCT TGCAGGCACT TGATACCGGG CAGCAGTTCC AGATCATTTT TTACAACGAC
AGCCCGACGT TTTTGAAAGG GACCAGCCGC GACGGAAAGG CCAGCTTGTG GTTTGCAACC
GAGATCAACA AAACTCTGGC AACACAGCAA ATCAGTGCGG TTCAGCCCGA CCGGGGAACA
CAACATCTGC CGGCACTGAA GCTCGCTCTC AAGTTTTCTC CTGAGGTGAT CTATTTCCTC
ACAGATGCGG ATGAACCCGA GTTAACATCG ATTGAGCGGA AAGAACTGAT TCGGTTGAAT
CAAGGTCGCA GCCGCATTCA TACGATTGAG TTCGGTCAAG GGCCGGAGTT GAAGACGGAG
AACTTCCTCA AGAAAGTCGC TCGCGAGAAC GGAGGAAGCT ATCGATATGA AGATGTCACC
CGCTTCACTT CCCGATAA
 
Protein sequence
MTTPQRAVTE PVIQVSQGAA RQSALVGWLL SGCLHLIAIV LLLFVIQGQS VERVGFGNGA 
EGPPGLVMVD ASDGGLTQEA APSEIGNGNR PLRTAAGSSG TGREVQSTEL PADVPPVPLS
LPKAGAAGLG SARGNLASEF SPTAGTGNSG TGLTGTGTSG DLRDLIEGTG TRKPGTGLGA
ATPGTSFMGI KDQGSRVVFV IDCSGSMTNY NAMRVAKTAL VSSLQALDTG QQFQIIFYND
SPTFLKGTSR DGKASLWFAT EINKTLATQQ ISAVQPDRGT QHLPALKLAL KFSPEVIYFL
TDADEPELTS IERKELIRLN QGRSRIHTIE FGQGPELKTE NFLKKVAREN GGSYRYEDVT
RFTSR