Gene Plim_3164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3164 
Symbol 
ID9139878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4092946 
End bp4094310 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003631178 
Protein GI296123400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.512918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAATA CTGGACTGAC ATTCGCAATC GTCGATTCCT GGAGGCTGCC AGCCTGGCTG 
GCCTTTACTG TCTGGGGTAT GGGTTCATTC CTGGCAATTC TGCTGGTGAT CAATCGCGTC
CCGGTCATGT ACAACGTGTT GAATATGGTC GTGCGCTGGA GGAATACGCT CCTCTCGGCC
ATGGCCTTTA CTATGGTCAT TGGCCTGTTG ACAGTCATGC AGGCTTTTGT GAATGGCATG
TACCGCCTGA CGGAAACCAG TGGTCAGCCC GGCAACGTGC TGATTCTGGC TGAAGGTGCC
TTGGATGAAG CCTTCAGCAG TCTCGCCATG GCCGATTCCA ACGATATTGC CAATCAACCG
GGGATCTTGA AGGATGGCGA ACAGCCTCTG GTCAGTTGCG AAACGTATCT CCTCGCCGCC
CAGAGTGTTC CGCAACCTGA AGGGAAAACG AAACGCCGCT TTCTACAGAT CCGTGGCCTC
GTCGATCCTG TCATGGCTGG TCGAGTGCAT GGCATGACTC TCGATTCTGG TGAATGGTTT
TCGGAAGCCG GGATTCGCCC TGACCCGGAT GATGTGAATG CGGCTCCACT CGTCGAGGTG
GTGATGGGAG CTGGCATGGC CACGGAAATG GGGCGAGATC GAGGCACGAT TGACGGCCGT
TCCCGCAAAG AATTCGTTGT CGGCGACCGT TTTGAACTGA ATAACCGCGT GTGGTATGTC
ACCGGGATTC TTAAGCGGAC TGGTTCATCA TTCGATTCAG AGATCTGGAG TAAACAGAGT
GTCGTCGGCC CGATGTTCGG TAAAGAACGC TACACGACCA TGGTCGCGCG GACGGCTGAC
AACGCCAGTG CTCAAAAACT CAAGGAATTC TTTAATACCG AATATACCAA AGCACAATTA
AGTGCCGAAG TTGAATCGGA GTATTACAAG AAGCTGTCGC AGACAAATGA GCAGTTTCTA
TATGCCATTA TTATTGTGGC ATGCATCATG GCGATTGGTG GCAGCTTTGG CGTGATGAAC
ACCATGTATG CGGCTGTCTC TCAGCGAATC AAAGATATCG GTGTCCTGCA ACTTATGGGC
TTCAAGCGCA GGCACATCCT TGTCTCGTTT GTGCTGGAAT CGCTGCTCCT GGCACTGTTC
GGAGGGCTGC TGGGTTGTTT GATTGGCTTG TTCGCCGATG GCTGGACAGC CAACAGTGTC
GTCTCAGGGA GCGGTGGCGG TGGAAAATCG GTCGTTCTGG AACTGACTGT CGATGGTGCT
ACGCTGATTT CAGGACTGAT GCTGGCCATG GTGATGGGCC TCATTGGCGG GTTTCTTCCT
GCGATTTCAG CACTGAGGCT TCGGGCACTC GATGCTCTCC GCTAA
 
Protein sequence
MINTGLTFAI VDSWRLPAWL AFTVWGMGSF LAILLVINRV PVMYNVLNMV VRWRNTLLSA 
MAFTMVIGLL TVMQAFVNGM YRLTETSGQP GNVLILAEGA LDEAFSSLAM ADSNDIANQP
GILKDGEQPL VSCETYLLAA QSVPQPEGKT KRRFLQIRGL VDPVMAGRVH GMTLDSGEWF
SEAGIRPDPD DVNAAPLVEV VMGAGMATEM GRDRGTIDGR SRKEFVVGDR FELNNRVWYV
TGILKRTGSS FDSEIWSKQS VVGPMFGKER YTTMVARTAD NASAQKLKEF FNTEYTKAQL
SAEVESEYYK KLSQTNEQFL YAIIIVACIM AIGGSFGVMN TMYAAVSQRI KDIGVLQLMG
FKRRHILVSF VLESLLLALF GGLLGCLIGL FADGWTANSV VSGSGGGGKS VVLELTVDGA
TLISGLMLAM VMGLIGGFLP AISALRLRAL DALR