Gene Plim_1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1190 
Symbol 
ID9137878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1523401 
End bp1524771 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content56% 
IMG OID 
Productprotein of unknown function DUF147 
Protein accessionYP_003629224 
Protein GI296121446 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCATCG TGGATCTGGC AGACCTGGTC AGTGGAATCC GGATTGTCGA GTTGACTGCC 
AAAACGCGTC AGGGAGCGAT CCGCGCTCTG GTTCAGGCCG CCAACTGGGA TGATGATGGA
ATCAATCCCG AAAATGTTCT CGAAGCCATC GAGGAACGTG AGGCCGCCGC ACAAACTCTT
GTGGCTAATG ACTTTGCATT ACCACATGCA TTTATCGACT GGGATGGTGA CTTCCGCATC
GTGCTTGGTC GCAGCAAAAC CCGTGTCGAT TATGGCGGGC CTGCGGGCGT TAATGTCCAA
CTGATTGTGC TCCTGGTCAT TGGGCGCCGA CTGCAACAGA CACATGTGGA AGTCCTCGCT
GCTCTTGCCG AACTGCTGAA AAGTCCCGAC TTCCGACAAA ACCTGATTGA CGCCAAAGAC
ATCAAGGCCA TCGATCTGCT CCTGATGACA CAAGCGGGCA TTCAACCCGA GAATCGTCCC
GTCCGTGGGC CGAGCATTCC CCGCCTGACT GTCAATATGG TGAAGACTGC CATTCAACTG
ACAGAAAGCC TCGCCGCCCA GGCTTTGCTT CTGGCGGTCG AACGAGTGGA GAACGTTCCG
TGGGAGGCAC TGGCCAACTA CAAGGGCCGC CTGCTCCTGG TCACTTCGCA ACACTCTGAA
GAGTTCGACC ACAAGCGAGA AGATCTGCAT ATTTTTGATG TGGCACACGC ATCGCTTTCC
CGTGCAGATC GCGCCAATCT GGGCTTGCTA TTAGCTGCAT CCAGCGGTTT ACTCACCGAA
AAAAACAGCG TGGTCTGTGT GACAGGCCCC GATGGTCGCC GGCTCGACAG CATCAATGTC
ACAAAACCGG AAGTCCATTT TCGGGCGATG CTCTCAGAGA AAAATCGACG TGGTGCCGAT
GTCGTCCGTC CCGCCGTCAT GCTGCGCGTC TTGACTCTCG CGATCGAGAT TGCCGCCGAA
GGTCGCGAAG CGCACCCTAT TGGTGCCTTG TTCGTGATTG GTGATTCCCG TCAGGTACTA
AGGCACTCAC AGCAACTCGT GCTCAACCCT TTTCATGGTT ATGCCCGGCA ACTGCGAAAT
GTCCTCGATC CGAGCCTGGC AGAAACGATG AAGGAGTTCG CTCTCATCGA CGGGGCTTTT
ATCATTCAGG GCGACGGAAC TGTGCTTTCT GCGGGAACCT ACCTCACTCC CAAATCGGCT
TCGGGAAGTG TGGCCCATGG GCTGGGGGCA CGTCATCAAA CCGCAGCCGC CATCTCGGCT
CACACACAAG CGATGGCGAT CACAGTCAGC CAATCCACCG GGACAGTCAC CGTCTTCCGC
AATGGGAGCT CTGTTCTCTC ACTTGAGCGT TCCGGTCGCA CCAAGTGGTA G
 
Protein sequence
MIIVDLADLV SGIRIVELTA KTRQGAIRAL VQAANWDDDG INPENVLEAI EEREAAAQTL 
VANDFALPHA FIDWDGDFRI VLGRSKTRVD YGGPAGVNVQ LIVLLVIGRR LQQTHVEVLA
ALAELLKSPD FRQNLIDAKD IKAIDLLLMT QAGIQPENRP VRGPSIPRLT VNMVKTAIQL
TESLAAQALL LAVERVENVP WEALANYKGR LLLVTSQHSE EFDHKREDLH IFDVAHASLS
RADRANLGLL LAASSGLLTE KNSVVCVTGP DGRRLDSINV TKPEVHFRAM LSEKNRRGAD
VVRPAVMLRV LTLAIEIAAE GREAHPIGAL FVIGDSRQVL RHSQQLVLNP FHGYARQLRN
VLDPSLAETM KEFALIDGAF IIQGDGTVLS AGTYLTPKSA SGSVAHGLGA RHQTAAAISA
HTQAMAITVS QSTGTVTVFR NGSSVLSLER SGRTKW