Gene Plim_3874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3874 
Symbol 
ID9140592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4981312 
End bp4983648 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content56% 
IMG OID 
Productcapsular exopolysaccharide family 
Protein accessionYP_003631885 
Protein GI296124107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCAGA CACCCCAGGA TTCCGAATTG GCATCATCCT TCGAAACCGA AGGCGAAGGA 
GGCCTTTCGA TTGACTGGAT GAATCTGGTC TTCCGCTACC GCTGGTGGCT GTGTGCAGGT
GTCGTGTTGG GTCTCGGTTT AGGGATCGCT GCCTACATCA AATTGGGGCC GGAGTACATG
GCCATTGGCC AGGTGATGGT TTCCCGCCGC AATGCCGTCC CGCTCAAAGA ACAAAGTGCG
ATTGGCGACT GGGGTGAACG CAGCGAGCAT ATCGCCTTGA TCATGAGTCC CATGATTGCC
AGCGAAGCCG TGCGTCTTGG TCAGCTCGAC AAGCTCAAAA CGTTTGCCGG TGAAAGCGAC
CCCACACAGA TGGTGCTCGA TGACCTCAAG GTCAAACGTG TGGCCGGTCA GGATCGTTCC
TACCTCAACG TTCTCGATGT CACCTTTCAA AGTGCCAGTG CCGACGATGC CCGTCACGTC
GTCAAAGCCG TGATCGATGC CTATGCCAGT TATCTCAATG AATCCCGCAG TGAGAAAACC
ACAGAAGTTT TCAAAACCGC GCAGGAGGCT CACGATACTC TCCAGAAATC CCTGCGCGAG
AAAGAAGAGG CGTATCTCAA GTTTCGCGAG TCGGCTCCTT TACAGTGGAG GGCTCCCATC
GGTGCCATCG CCGCCGATGG CACATCCGGC GCCACCAACG TCCATCAGGA ACGTGTCATC
GCTGTCGAGG AGCAGCGTCG GCTGAATCTG TTGCGACGGA CAGAAATCAA TTCACGCATG
CAGGCGTTAC AGAAAGCCCA GGATCAAGGC GAATCCACCT CCACGATGGA AATGCTCATC
CGCCGCTTCA TTGCCAACGA TGGCTCAGGA GCTGAACTTC AACAGCAGCA ACAGGAAATC
TCCGCCTTCG AGAATCGTTT ACTCCCTCTC ATTCTCGAAG AAGAAAAGTT ACTGCGAGAT
TTCGGGCCCG ATCATCCCGA AGTCAAAGCC GTTCGCAAAT CGATCGATAC CACGCTCGAT
TTCTATCGCA AACATGGTCT CCGCATGCCC GATGACATGA CGCCCGGCCC CGATGGTAAG
CCGCTCATTA AAGAACGTGC GAACCTCGTC CAGAACTACA TGGAGACTTT GCGACAGCAG
CTCAAAGAAC TCGACATCCG CGAACAGCAA CTCAATGTCC TCTTTGAACT CGAATCCAAC
AAAGCCAAAG AATACGCCCG CTTTCAGGCC GAAGATCAGG CCATGACGGC TGAACTTAAC
CGCCTGCGCA GTCTCTGGGG TCAACTCGTC GATCGTATGA GTCAACTCAC CATCGACAAA
GAATCCAGCG GCTACACACT CCGCCAGACA GGCCCGATCA AAGACGAATG GGTCCTCAAA
CGACTGCTCA AAATCGTTGG AGCAGGCATG GTGGCCGGCA TGGGCCTGAT CATCTCTTTG
ATTGCCCTCA AGGAGTTCCT GAGCAATCAG GTGCGTACGG TTCGAGAAGT GCGCCAATTG
CTCCCCGAAG CCTATCTGGG GGCTGTGACC TCTTTCGATC CCGAGCAAAG CCGCCTCGAC
CCGGTACAGA ATGTTCATCC TTCACTCAGA TATCTAAGGT CGCCCGCCTC CATCGAAGCC
GAGAATTACC GCACCATCCG CACTTCGTTG CTCGTCACTG CCGAAGCTCT GAACGCGCAA
GTGATTCAGG TCGGCAGTCC CGAGCCAGGC GACGGAAAGA CCACGCTCGT CTCGAATCTG
GCACTCGCAC TGGCCAGCTC CGGCAAACGA GTCCTGCTCA TCGATGCCGA TCTCAGACGC
CCGATGGCCA CTCGACTGTT TGGCTTAAGG CCCGATCCGG GACTTGTCGA CGTTCTCCAG
GGAGAAATTG CCCTCGAAAA TGCCATTGTC GAAACTGTTG TCGAAGGCTT AACCATCCTC
CCCTCCGGCA GACCGCCGCA TAACCCGGCA GAACTGCTCG AAGGTGGGCC GCTGCGTCAG
TTGATTGCCA AGGCTCGAAC TTTGGCCGAT ATCGTGCTGA TCGATGCCCC ACCCGTCCTC
GCAGTGAGCG ATGCCTGCAT CATCGGCCAA CACGTCGATG GCTATCTGCT CACTGTCCGC
TTAGGGAAAA ACCGTCGCCC CATGCTCAGG CGTTCGCGCG ATCTCCTCCT GGCACATCAC
ATCCCGATTC TGGGTGTGGT TGCCAATGGC GTCGAACCCA GCGACCGCGA GGAAATGGGC
TACTACGCCG ACTACAACCG CAATGAAGCC TTCTTTCCTC AAGACCAGTC ATCGCCGGAG
AAATCATCGA CTCATCGAGA TTTGACCATA TCGGCCAGTC CCGTGAGGCA GCCGTGA
 
Protein sequence
MRQTPQDSEL ASSFETEGEG GLSIDWMNLV FRYRWWLCAG VVLGLGLGIA AYIKLGPEYM 
AIGQVMVSRR NAVPLKEQSA IGDWGERSEH IALIMSPMIA SEAVRLGQLD KLKTFAGESD
PTQMVLDDLK VKRVAGQDRS YLNVLDVTFQ SASADDARHV VKAVIDAYAS YLNESRSEKT
TEVFKTAQEA HDTLQKSLRE KEEAYLKFRE SAPLQWRAPI GAIAADGTSG ATNVHQERVI
AVEEQRRLNL LRRTEINSRM QALQKAQDQG ESTSTMEMLI RRFIANDGSG AELQQQQQEI
SAFENRLLPL ILEEEKLLRD FGPDHPEVKA VRKSIDTTLD FYRKHGLRMP DDMTPGPDGK
PLIKERANLV QNYMETLRQQ LKELDIREQQ LNVLFELESN KAKEYARFQA EDQAMTAELN
RLRSLWGQLV DRMSQLTIDK ESSGYTLRQT GPIKDEWVLK RLLKIVGAGM VAGMGLIISL
IALKEFLSNQ VRTVREVRQL LPEAYLGAVT SFDPEQSRLD PVQNVHPSLR YLRSPASIEA
ENYRTIRTSL LVTAEALNAQ VIQVGSPEPG DGKTTLVSNL ALALASSGKR VLLIDADLRR
PMATRLFGLR PDPGLVDVLQ GEIALENAIV ETVVEGLTIL PSGRPPHNPA ELLEGGPLRQ
LIAKARTLAD IVLIDAPPVL AVSDACIIGQ HVDGYLLTVR LGKNRRPMLR RSRDLLLAHH
IPILGVVANG VEPSDREEMG YYADYNRNEA FFPQDQSSPE KSSTHRDLTI SASPVRQP