Gene Plim_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1056 
Symbol 
ID9137742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1340768 
End bp1342420 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003629095 
Protein GI296121317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATC GATCTTTTCC AGGGTCTTCC GGGGAGTTCA CTTCCCTCCA GGAAAGCCTC 
ACCGAAGATC AGTTGCTTGA GGCCGAAGCC TGCATTGCCA ACGGAGAGAT GGCAAAAGGC
CAGTCGCTGT TTCATCAGCT CCAGAAAGAT CTTCAAGACT CAAGATCTCA GGCACGAATC
TGGAATGGAC TGGGTGTGAT TGCTGCACTG CAGTCGGATC ATGCGACGGC GATCGATAGT
TTTGAACAGG CTCTCCGCTG TGATCCATCC TGGAAGATAC CGATTGTCAA TCGAGACCGA
TGGTTGGCGT CAGACACCGA ACGGTCGAGG GCGAAAGAGA CAAGTTGCCC CGACACCTGG
ACCACTGAGC AAAATCACTC ATTACCCCGC ATCGCCATCA TCAGTCTGCT CTTTAACTGG
CCCTCCACCG GTGGTGGGAC CGTGCATACT GCGGAACTGG CTCGATTCCT GCATCGAGCA
GGTTATCCCG TTCGGCATTT CTCTTTTCGT TACGATGCCT GGCAATTGGG TCAGATCCAG
GAACCTGTGG ATTGGCCTCT CGAAGAATTG CTGATGCAGG CTTCGGATTG GAATGCTCTC
CAGATTCAGC ACCGATTGCA GCAGGTACTT CGCCAGTTTT CTCCAGACGC TGTGATTGTG
ACTGATAGCT GGTCCATGAA ACCCATTCTG GCAGAAGCGT GTGAGGGATA TCCCACTTTT
CTACGCCTTG CCGCTCAGGA GTGCCTGTGC CCGCTCAACA ATGTTCGGCT GCTCCAGATT
GCACCTGTCC CCCGCTCCTG TCCCAGACAG CAACTGGCGA CACCAGAGAT CTGCATCCAG
TGCGTCACTG AGAATGAGCG TTTCTCGGGA GGACTGCATC GCCGGGAGCG TGCGCTGGCA
GGCTTTGGGA CCCCTGAGTA CAGCCAGCGG CTGCGCCGGG TCTTTGCCCA GGCAACGGGA
ATTCTGGCAG TCAATCCGCT GATTGCCGAA GCCTGTAAGC CGTTCGCTCA AGCAGTGCAT
GTGGTTCCCA GCGGGTTTGA TCCGGCCCGT TTTCCGATCG CACAGACTGC GCCTCGCCAC
CCCGAACATG TCGTGCGACT GCTCTTTGCA GGCCTGGTTC AGGAACCCAT GAAAGGGTTT
CATGTGCTTC TGGAGGCGGC TCGTCAGCTC TGGCAGCATC GACAGGACTT CCAATTGGTG
GTCACCGATG AGCCTCTGGA TCTCGCAGAT CCTTTTGTGA AGTTTGTGGG ATGGCAATCT
CAGGCCCATC TGCCACAGGT CATGCAATCG TGTGATATCG TGATCTGCCC CACCATTGCG
GAAGAAGCAC TTGGTCGAAC GGCCGTGGAA GGGATGGCGG CAGGTCGTCC TGTCATCGCC
AGTGCCATTG GCGGATTGTC TTTTACCGTT CTGGATGAGG CGACGGGACT CTTGTTTCCG
CCGGGAGACA GCCCTGCTCT TTCGAGACAA ATTGAACGGC TTTTGGATGA TTCCTCGCTT
CGAAGAAGCC TGGGAATCCG CGGTCGCGAA CGATTTGAAA AAGAATTTAC GTGGAATTCG
ATACTCGATC GGCATTACCG CCCATTATTC GAAAGTACTC GCAGGGAATT TCAAAATCGA
GCCCAGGGCC TTGTCTCATC GGAGGGTGTA TGA
 
Protein sequence
MNDRSFPGSS GEFTSLQESL TEDQLLEAEA CIANGEMAKG QSLFHQLQKD LQDSRSQARI 
WNGLGVIAAL QSDHATAIDS FEQALRCDPS WKIPIVNRDR WLASDTERSR AKETSCPDTW
TTEQNHSLPR IAIISLLFNW PSTGGGTVHT AELARFLHRA GYPVRHFSFR YDAWQLGQIQ
EPVDWPLEEL LMQASDWNAL QIQHRLQQVL RQFSPDAVIV TDSWSMKPIL AEACEGYPTF
LRLAAQECLC PLNNVRLLQI APVPRSCPRQ QLATPEICIQ CVTENERFSG GLHRRERALA
GFGTPEYSQR LRRVFAQATG ILAVNPLIAE ACKPFAQAVH VVPSGFDPAR FPIAQTAPRH
PEHVVRLLFA GLVQEPMKGF HVLLEAARQL WQHRQDFQLV VTDEPLDLAD PFVKFVGWQS
QAHLPQVMQS CDIVICPTIA EEALGRTAVE GMAAGRPVIA SAIGGLSFTV LDEATGLLFP
PGDSPALSRQ IERLLDDSSL RRSLGIRGRE RFEKEFTWNS ILDRHYRPLF ESTRREFQNR
AQGLVSSEGV