Gene Plim_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2839 
Symbol 
ID9139551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3674785 
End bp3676356 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content50% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003630860 
Protein GI296123082 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTGGT TAATTGTTGC CGGGTTGATT GCTGTCACGT TGCTCACATT TGTGATGACA 
ATCGCGCAAC TCGACCTGTA CCGTCGCTAC TGGCTCTCGG TACTTCGCAA AGATCGCCAG
CGCGAGGTAC CACCCCTTCC TGAGTCACTA CCGCGAGTGA CCATACAACT CCCCATTTAC
AATGAGTCTC CCGTCGTTCA TCGACTCCTC GAAGCCGCTT CACGAATTGA CTATCCTCAT
AATCTCTTAC AGATTCAGGT GCTTGATGAC TCGACAGATG ACTGCTCGAA AATTCTCGTC
GACAAAGTCG CCGAGATTCA ACAGCGAGAT CCCAGCCTGA ACATCCAGTA TCGACATCGC
ATCGATCGTA CAGGTTACAA AGCCGGAAAT CTGGATGAAG GGACCACCTG GGCGACAGGT
GAGTTCATGG CCATTTTCGA TGCTGATTTT GTACCGAAAC CAGACTATCT TCAGCAGACC
ATCCGCTACT TCCAGAACGA AGAAATTGCT ATCGTTCAAA GTCGATGGGG ACACCTGAAT
CCTGACTCGT CAATTGTGAC TCGAGTTCAG CAATTCTTTC TGGATGGACA TCTTTCGGTC
GAGCAGAGAG GGCGAGGCGA TAGCGATCTG TTTTTGATCT ATAATGGATC CGCTGGCATC
TGGCGAAAAC AGGTCATCGT CGATTGCGGC GGCTGGATGA CAACGGCCGC CATTGAAGAC
GTGGATATGA GTTATCGAGC CCAGTTGCGC GGAAAAAAGA TTGTCTATCT CGAAGACTAC
ACGACACCAG GTGAGTTACC CGATTCAATG ATCGCCCTCA GGCTGCAACT CTTCCGCTGG
TGGAAGGGAA ATCTGCAGAT CGCTATTAAG TATATTCGCC AGGTCTGGCA AAGTGATTAC
CCGCTTATCA AGAAGCTACA TGCCACGACA CACCTGTTTG GCCCTCTGAT GTCCGCAGTA
ACATTTGCGA ATATTATTCT CGCAGGGGCT GTGCCGCTGA TTGTGACGTG GTACCCGGAA
ACTCGCTATT GGTTGGCATC GACACTACTG GGGGTAGCAC TCATTCCCGT CCTGTTTCTA
GTTTACGGCA CCGGAAGAAT CCGTTTTGGT GAAGGCAGTC GTTGGCAAAA GATATTGGGT
ATCATTCCTT TGGGAAGCAT GCTGATGGTG CTGCATTCCG GACTATCGTG TCAGCATACG
GTCTCTGCTT TTGAGGCTTT TTTCGTTAAG AAGAATGTCT GGGTCGTCAC TCCCAAAGGG
TTTTCCAGCA CAGGCGCAAC ACAGGCAAAA CGTCGCCGCA TCAAGATTCC ATGGTATTTC
TGGCTGGATG CACTAGTCAT TATCTACCTC ATCGGCTGCG GCTGGATGGC CTTGATGTTT
CAGTTTTATA TGATTGCGGC CCTGCAGGTG CTTTGGATCT GTGGCTTTCT TTGGGTTCTG
GGTGGATCTT TGTGGGAGGC GAACAAAGAC CAGAGAGCCT TCTCTTTGTC ATCTACCCAA
AAAGATCGTT TAGAAAACAC AGCGGCAGAA CTCACACCTG GCCCACTGGA GAGTGCCAGT
CTAGCGTCTT GA
 
Protein sequence
MVWLIVAGLI AVTLLTFVMT IAQLDLYRRY WLSVLRKDRQ REVPPLPESL PRVTIQLPIY 
NESPVVHRLL EAASRIDYPH NLLQIQVLDD STDDCSKILV DKVAEIQQRD PSLNIQYRHR
IDRTGYKAGN LDEGTTWATG EFMAIFDADF VPKPDYLQQT IRYFQNEEIA IVQSRWGHLN
PDSSIVTRVQ QFFLDGHLSV EQRGRGDSDL FLIYNGSAGI WRKQVIVDCG GWMTTAAIED
VDMSYRAQLR GKKIVYLEDY TTPGELPDSM IALRLQLFRW WKGNLQIAIK YIRQVWQSDY
PLIKKLHATT HLFGPLMSAV TFANIILAGA VPLIVTWYPE TRYWLASTLL GVALIPVLFL
VYGTGRIRFG EGSRWQKILG IIPLGSMLMV LHSGLSCQHT VSAFEAFFVK KNVWVVTPKG
FSSTGATQAK RRRIKIPWYF WLDALVIIYL IGCGWMALMF QFYMIAALQV LWICGFLWVL
GGSLWEANKD QRAFSLSSTQ KDRLENTAAE LTPGPLESAS LAS