Gene Plim_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3620 
Symbol 
ID9140338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4654986 
End bp4656473 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content57% 
IMG OID 
Productprotease Do 
Protein accessionYP_003631631 
Protein GI296123853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTC TCTCTCGATT TGGCGGCAGT CTGGCCACAC TTTCCGCTGG AGCAATTGTC 
GCGGTGGCCT CGATGCAATA CGCAGAACAT ACCCAGGCTG TAGCGGTTCC CCGCGTCCTC
ACGCCGGGCG AGTTTTCGAT GGCGTTTCGT GAAGTCGCTT CGCAAAGCCT GCCAGCGATT
GTCTCGATTC GGACACTCAG CAAAGGGAAC GCGGCTAACG TTGGCAGCCT TCCGGGCGGC
GACGACATGC CACTGCCAGA ATTCTTCAAG AACGATCCCC GCTTCCGTGA CATGCTCAAA
GCACCGCGCC AGCAGCGTGC ACCCATGCAA CGTGGGATGG GGAGTGGCTT CGTGATCGAT
GCCAGCGGGA TCATCATGAC CAATAACCAT GTGGTCGATG GGGCCGACGA AGTGATCGTG
ACGCTTCAGA ACGGTAAGGA ATACGTTGCC AAGGATATCA AAACTGATCC TCGGACAGAC
GTGGCCATTC TGCGGATCGA AGGAGCCAAA GACCTCGTCG CACTTCCTTT AGGTGACAGC
GACTCGGCAC AGCCGGGCGA CTGGGTGATG GCGATTGGTT CTCCCTTTGG ACTCGATACC
AGCGTGACAG CCGGGATTGT CAGCGGTAAA GGCCGTGGGA TGGGGATTAC CGAACGCGAA
GACTTCATTC AGACCGATGC GGCTGTGAAC CCTGGCAATA GTGGCGGGCC GCTGATCAAT
CTGCGTGGTG AAGTGATCGG TATCAACACC GCGATCTCGT CCCGCAGTGG CGGTTACGAT
GGTGTGAGCT TCTCGATTCC AATCAACATG GCGCAGTGGG TCAGCAAGCA ACTGGTGGCC
AGCGGACAGG TCAAGCGGGC TTATCTGGGC ACATCGATTG CCCCTGTGGC CGAATCGATT
GCACTCAAGC TGGGTGCCAA TGCGGGTGAA GGTGTTGTGA TCCAGATGGT TCGCCCCGAT
TCACCTGCTG CCAAGGCAGG CCTCGAACCA GGCGACGTGG TGATCTCGGT CAATGGTGTG
AAAGTGAACG ATCCACGTTC CTTGCAGTCG GCTGTCGAAC GGCTCGATAT CGGCAAGTCC
TACCCGATTG TCGCCAAGCG GCAAGGTAAG GAACTCAACC TGAGTGTGGT GGCCGAAGAA
ATGCCCAGCG ATTTTTCACG CTCGCAACTG GCTCAGTCGG GTAAGCCCAA GAGCCAGTCG
CTGGACAACA TCGGGATGTC GATTGACCGG CTGACCCCAT CCATTGGTCG CCAACTGGGT
GTGACTGGTG AAGGTGTCGT CGTGACTGAA GTGGCGGGTG ATTCTGCCGC TGAAGCTGCC
GGTGTGAAGG TGGGTGACGT GATCGAAAAG ATTGGCGACA AAACGGTGGC TCAGCCCGAA
GATGTGAAAG CCGCTCTGGC CAATGTCGAT CTGAAAGAAG GGGTCATCCT GCACCTGCGA
AATGCGGAAG GCAAGCGATT TGTGATCCTC AAGAGTGTCG ATGAATAG
 
Protein sequence
MSFLSRFGGS LATLSAGAIV AVASMQYAEH TQAVAVPRVL TPGEFSMAFR EVASQSLPAI 
VSIRTLSKGN AANVGSLPGG DDMPLPEFFK NDPRFRDMLK APRQQRAPMQ RGMGSGFVID
ASGIIMTNNH VVDGADEVIV TLQNGKEYVA KDIKTDPRTD VAILRIEGAK DLVALPLGDS
DSAQPGDWVM AIGSPFGLDT SVTAGIVSGK GRGMGITERE DFIQTDAAVN PGNSGGPLIN
LRGEVIGINT AISSRSGGYD GVSFSIPINM AQWVSKQLVA SGQVKRAYLG TSIAPVAESI
ALKLGANAGE GVVIQMVRPD SPAAKAGLEP GDVVISVNGV KVNDPRSLQS AVERLDIGKS
YPIVAKRQGK ELNLSVVAEE MPSDFSRSQL AQSGKPKSQS LDNIGMSIDR LTPSIGRQLG
VTGEGVVVTE VAGDSAAEAA GVKVGDVIEK IGDKTVAQPE DVKAALANVD LKEGVILHLR
NAEGKRFVIL KSVDE