Gene Plim_2950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2950 
Symbol 
ID9139662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3817303 
End bp3820443 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1549 
Protein accessionYP_003630971 
Protein GI296123193 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.798695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTG ACCTGCGCGG CAACAGACTT TCTGGCAAGT TATTGGCGGC TGTGTGCCTG 
GTAACAGCTC TGGAAATGAC CATTCCGCAG CTTGCCAGTG CCAGTGAAAA GCTGCAGTTC
AATCGGGATA TTCGTCCGAT CCTGTCGAAC AGTTGTTTTC AATGTCATGG CCCGGACAGT
GCGAAGCGGG AAGCCGGGCT GAGGCTGGAT CAGCGGGCAG CGGCTGTCGC GCAAACTGCT
TCCGGAGTGC AGCCGATTGT CCCGGGCGAT CACGCTGCCA GCGAGATCAT CTCGCGGATC
CTCTCAGATG ATCCCGATCT GAAAATGCCA CCGCCTGAGA GTGGCAAAAC AGTGACTCCC
GAACAACTGG CGACACTCAA GCGCTGGGTG AGTGAAGGTG CGGAATATGA ATCACACTGG
TCGTTCCTGC CGATTCAAAG GCCTGCTGTT CCCGCTGTCA AAAATGACAG TCGAGTGCGT
ACTCCGATTG ATCAGTTTCT ACAGGCCCGT CTCGAAAAAG AGGGTTTGGG CTTAGCCGAT
GATGCTGATC GGGTGACGCT CATCAGGCGG GTCACGTTCG ACCTGACGGG TTTGCCTCCC
ACACCGGCTG AGGTCGAGGC TTTTGTGACG GATCCTGCGC CAGATGCCTA TAGCAAAGTG
GTCGACCGGC TGCTGGCTTC GCCGCGTTAT GGAGAGCACA TGGCCCGCTA CTGGCTGGAT
GCCGCCCGCT ATGGGGATAC GCATGGTCTG CATCTGGATA ACGAACGTTC GCTCTGGCCT
TATCGCGAGT GGGTCATCAA TGCCTACAAC AATAATCTGC CGTTTGATCA GTTTACGATT
GATCAGCTGG CGGGAGATCT TTTGCCAGCA CCGACGCTCG ATCAGCGGGT AGCGACGGGG
TTTAATCGCT GCAATGTGAC GACGAGTGAA GGGGGGGCGA TCGACGCCGA ATGGTATGTC
CGTTACGCCA TTGATCGTAC GGATGCCGTC GGGACTGTTT GGATGGGTCT GACACTGACA
TGTGCTTCGT GCCATGACCA TAAATTCGAC CCGATTTCCC AGAAGGAGTT TTACTCACTC
TATGCCTTCT TCAACAGCTT GAGTGATCGG GCCATGGATG GAAATGCGTT GCTTCCACCA
CCTGTGCTCA AATTGCCGAC GGCAGATCAA GAGGCTCGAA TGGCGGAGCT CAACAAAGAA
ATCGCGACCT TAAAACAGCA ACTGGCGCAG AAGCGCAAGG AGACTCCTTA TGTCGAGCCC
TCGCTCGATG CCCCGGCTGA TGCATTAACT GTGACCCGCA GGGACTACAT CTGGATCGAT
GATGTTCTCC CGCCAGGTGC CCAGCCCCAG TCGAACACGG GGTCGTGGAA GTTTATTTCG
GCTGCTGAGG GCCCGGTTCT CGCTGGAAAT CTGGCATCAA CGCGTACCTC CGAGGGGCTG
GATCAACATT TCTTTACCGG GGCGACTTCC CCACTTGCGA TTGGTGAAGG AGACGAGCTC
TTTGCGTATG TCTATCTCGA TCCTGCCAAT CCGCCAAAAA CGATCATGCT CCAGTTCAAT
GACGGCACAT GGGAGCATCG CGTTTACTTT GGCGAAGATG CGATCCCTTA TGGCAAGGCA
GGAACTGTGG GCCATCGACA CGGTGGGCCC TTGCCACCCG TCGGACAATG GACACGAATT
TCTGTCAATG CCCAGCATGT GGGCCTGGCT TCGGGTGCAA AGCTGAATGG CTGGGCCTTT
ACGCAGTTCG GTGGAACTGT TTATTGGGAT AAAGCCGGCA TTCGCACGTT GACACTTCAG
AACGGTGAAT CGTTTAATTC GCTATTGGCA TGGGAAAACT ATCAGAAACA ACAGAAGAAG
CCCGCACTTC CGGACAATAT CGCCAAGCTG ATCAAGATCG AATCTGACAA GCGGAATGCT
AATCAGCAGA AACAGATTGT CGAGTACTTT GTCGAGTATG TGCATCCGGA GACTCGCCAG
CTCTTTGAGC CGCTGAATGG GCAGATTGCC AAGCTCCAAA GCGAAATCGA TGCTGTCGAA
AAACAGATCC CAGCCACGCT CGTGAGTGAA GATATGGCAC AGCCACGTGA GGCTTTTGTG
CTGATTCGGG GGGCTTATGA TAAACCGGGC GAGAAGGTCT CCCGGGCGAC ACCAGCAGCA
TTGCCATCCA TGCCCGAAGA CTTGCCACGC AATCGGCTGG GGCTGGCCAA GTGGCTGGTT
TCGCGAGAAC ACCCACTCAC TTCGCGAGTG ACGGTGAACC GGATCTGGCA ACAGATTTTC
GGTATCGGGA TTGTGAAATC GAGCAACGAT TTCGGATCAC AAGCTGAATG GCCGACGCAT
CCCGAACTGC TCGACTGGCT GGCCTCCGAG TTCATAGAGT CTGGCTGGGA TCACAAGAAA
TTCTTGAAGC TGATCGTGAT GTCCCAGGCT TATCGGCAGT CCTCCAAAGT CACTCCGGCG
CTTTATGCGA AAGACCCTGA GAATCTGTTA TTGGGCCGTG GTCCACGTTT CCGGCTGGAT
GCAGAAACCA TCCGCGACAG CGTGCTGTTT ACCTCGGGAT TGCTCGTGGA ACGAAAGGGC
GGCAAAAGTG TGCGGCCTTA TCAGCCCTCG GGAATCTGGG AGCCAGTCGC ATTTCAAAGT
AGTAACACGC GGACATATAC CCGCGATCAG GGAGAGGCGC TCTTTCGTCG CAGCCTCTAT
ACCTTCTGGA AGCGGACAGC TCCACCACCA TCGATGACGA CATTCGATGC TCCATCCCGC
GAGACCTGCA CAGTCCGCAG GGCGAGGACG AATACACCCC TCCAGGCGTT GGCGCTGATG
AATGATGATC AGTATGTCGA AGCAGCTAGG CATCTGGCAG GGCAGATGAT CAGTCAAGGG
GGCGAGACTG CCGCCGAAAG GCTGAACTTT GCCTGCCTTC GCGTATTGAG TCGCAATGCG
AAAGCCGATG AGTTGGCAGT CCTTGAGCGA GTTCTGGAAA AACAACTGGA AATTTATCGT
GCGGATAGTA AGGCAGCTGA GGAACTGCTG AATGTCGGCA CACTTCCAAA ACCTGGTCAG
ATGGAAGCAC CGGAGCTGGC GGCTTATACC ATGGTGGCCA ATCTGATGCT GAATCTCGAT
GCGGCGATTA CGAAGGAGTA G
 
Protein sequence
MIFDLRGNRL SGKLLAAVCL VTALEMTIPQ LASASEKLQF NRDIRPILSN SCFQCHGPDS 
AKREAGLRLD QRAAAVAQTA SGVQPIVPGD HAASEIISRI LSDDPDLKMP PPESGKTVTP
EQLATLKRWV SEGAEYESHW SFLPIQRPAV PAVKNDSRVR TPIDQFLQAR LEKEGLGLAD
DADRVTLIRR VTFDLTGLPP TPAEVEAFVT DPAPDAYSKV VDRLLASPRY GEHMARYWLD
AARYGDTHGL HLDNERSLWP YREWVINAYN NNLPFDQFTI DQLAGDLLPA PTLDQRVATG
FNRCNVTTSE GGAIDAEWYV RYAIDRTDAV GTVWMGLTLT CASCHDHKFD PISQKEFYSL
YAFFNSLSDR AMDGNALLPP PVLKLPTADQ EARMAELNKE IATLKQQLAQ KRKETPYVEP
SLDAPADALT VTRRDYIWID DVLPPGAQPQ SNTGSWKFIS AAEGPVLAGN LASTRTSEGL
DQHFFTGATS PLAIGEGDEL FAYVYLDPAN PPKTIMLQFN DGTWEHRVYF GEDAIPYGKA
GTVGHRHGGP LPPVGQWTRI SVNAQHVGLA SGAKLNGWAF TQFGGTVYWD KAGIRTLTLQ
NGESFNSLLA WENYQKQQKK PALPDNIAKL IKIESDKRNA NQQKQIVEYF VEYVHPETRQ
LFEPLNGQIA KLQSEIDAVE KQIPATLVSE DMAQPREAFV LIRGAYDKPG EKVSRATPAA
LPSMPEDLPR NRLGLAKWLV SREHPLTSRV TVNRIWQQIF GIGIVKSSND FGSQAEWPTH
PELLDWLASE FIESGWDHKK FLKLIVMSQA YRQSSKVTPA LYAKDPENLL LGRGPRFRLD
AETIRDSVLF TSGLLVERKG GKSVRPYQPS GIWEPVAFQS SNTRTYTRDQ GEALFRRSLY
TFWKRTAPPP SMTTFDAPSR ETCTVRRART NTPLQALALM NDDQYVEAAR HLAGQMISQG
GETAAERLNF ACLRVLSRNA KADELAVLER VLEKQLEIYR ADSKAAEELL NVGTLPKPGQ
MEAPELAAYT MVANLMLNLD AAITKE