Gene Plim_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1400 
Symbol 
ID9138095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1780540 
End bp1781736 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content55% 
IMG OID 
Productdomain of unknown function DUF1745 
Protein accessionYP_003629433 
Protein GI296121655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.653815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCCT TTATGAACCG ATATGCAGCC GCCTGGACGA CAGAAGTTTC ATTAGTTCGT 
GCGATGGAAC AGGTTGCTAT TGAGATTCAA AGCCAGCTTG AGGGGCGACA CCCCGATTTG
CTGCTGGTGT TTTGCTCCCA CCATTATGCG GATGCCTGGC AGAATCTGTC GGCGGGGCTC
GTCTCCACAA CAGGTGCCAA AGTCCTGCTC GGCTGTTCGG GCGAATCGAT TGTGGCCACA
GGCCGGGAAC TTGAAAATGG ACCGGCACTT TCGATTTGGG CGGCCTCGTG GGACGGCGTG
GGAATGATCC CTTTTCAGGC AACGTTTGAA CGCACGCCGG ATGGCATCGT CACTACGGGC
TTACCACAGG GAGTCAATGG ACTTCTTCAG GGGAATGCTC GCTGTGCGAT CGTACTGGCC
GATCCGTATT CCTCATTGAC AGATCTGATC ACAGATCATC TGGCAGAGGA TTTGCCGAAC
CTGCCCGTCA TTGGTGGTAT GGCCAGCGGC GGTGGACCGG GCGAAAACCG CCTGTTTTAT
GCTCACAAGG CAATTGAACC GCAGGTTTTC GAAGAGGGGG CGATTGGAGT CATTCTCTCG
GGAAATCTGA CGTTTACACC GGTGGTCTCA CAAGGGTGTA AACCGGTGGG AACAACCTAT
GTGGTGACGA AGGCTGATCG AAACTTTATC GTGGAACTGG GTGGTGAACC TCCACTGGCC
CGTCTGGAAC AGCTTTACGC TGACTTATCC GCCACTGACC AGAGGCTGAT CGAAAACGGC
CTACACCTGG GATTGGCCAT GACCGAGTAT CGCGATCAGT TCCGCAGGGG CGACTTTCTG
ATTGCCAATG TGATTGGTGC TGATCGTAAT ACCGGAGTGC TGGCCATTGG CGGAAAAGCA
CGAGTCGGCC AAACCGTGCA GTTTCATCTG CGTGATCATG TGACGGCCAG CGAGGATCTG
GTCGAAATGC TAAAGACTGC CCGTTCCAGC CATCCGGCAC CGCAAGCCGC CCTGCTCTTT
ACCTGCAACG GCCGAGGGAC GCGGTTGTTT TCCGCTCCTC ACCACGATGC CCAAAAACTG
GAAGAATTCT TCGGCTCCAT TCCTGTCGCA GGATTTTTTG CCCAGGGAGA ACTTGGTCAA
GTCGGTACAA AGAACTTCCT GCATGGATTT ACGGCAAGTA TTGGGCTATT TGGATGA
 
Protein sequence
MIAFMNRYAA AWTTEVSLVR AMEQVAIEIQ SQLEGRHPDL LLVFCSHHYA DAWQNLSAGL 
VSTTGAKVLL GCSGESIVAT GRELENGPAL SIWAASWDGV GMIPFQATFE RTPDGIVTTG
LPQGVNGLLQ GNARCAIVLA DPYSSLTDLI TDHLAEDLPN LPVIGGMASG GGPGENRLFY
AHKAIEPQVF EEGAIGVILS GNLTFTPVVS QGCKPVGTTY VVTKADRNFI VELGGEPPLA
RLEQLYADLS ATDQRLIENG LHLGLAMTEY RDQFRRGDFL IANVIGADRN TGVLAIGGKA
RVGQTVQFHL RDHVTASEDL VEMLKTARSS HPAPQAALLF TCNGRGTRLF SAPHHDAQKL
EEFFGSIPVA GFFAQGELGQ VGTKNFLHGF TASIGLFG