Gene Plim_1791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1791 
Symbol 
ID9138492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2322880 
End bp2324157 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content55% 
IMG OID 
ProductNHL repeat containing protein 
Protein accessionYP_003629820 
Protein GI296122042 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.326453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATGA TCTCCGATAG GCAGTCAATG ACGATCGAAA ACGTGCAGAA TTCAAATCGG 
GCCCAGAGAT CTCATCGCGC TCACCGGCGG TCTCTGCTTT CCGTTCAAAG ACACCGCACT
GTCACCATGC GTTCCGTCAT AATGCGCCCC GGCCAAACGA GCTTTGGGCC ATCAAACCTA
ATGCGGCTGG CTGCACTCTG TGTGCTCCTG AGTCTCTTTG CCTTCGCCGA ACTCAATGTC
GCCCATGCTG GCGAAGTCAA AACAATCTTT GGATCAGGCA AAGATGGATT CAATGGCGAT
CAGCAGCCAT TCCTCGAAAC TCACAGCGGC CAGCCGTTTG GACTCGTGAT TGGGCCGGAT
GGTGCTTTGT ATTTCTGTGA GTACACAGGT CACATCATTC GCCGCCTCGA TCTGGAAAAG
CAGACTGCGA CAACCATTGC CGGGACTCCT GGCAAAAAAG GATTTGCCGG TGACGGCGGC
CCGGCGACAA AAGCCTTGAT GAACGAACCC CATGAACTCC GTTTTACTCC TGCCGGGGAT
ATCGTCATTG CCGATATGCG CACGCATACC ATCCGCAAGA TTGATGGCAA AACGGGCATG
ATTTCCACAC TGGCAGGCAC AGGAACCGCC GGATTCAGTG GTGATGGCGG GCCAGCCGAA
AAAGCTCAAT TGAATATGCC ACATTCCATT CAGATCGATC CGGCTGGCGA TCTGTTGATC
TGCGATACCG GGAACCACCG GGTTCGCAAA GTCGATATGA AAACGGGCCT GATCTCGACC
GCTTACGGAA CTGGCGAAAG GAAACCTGCC AAAGATGGTG ATCCGCAGGT GGGCACACCC
CTCAATGGCC CGCGCAGTAT CGACTTCACT CCCGAAGGAG ACATGATTCT CGCGCTTCGC
GAAGGGAACG CGGTCTATCG CTTTCCCAAA GGAGAAGCCA AACTCATCCA CATTGCTGGT
GTGGGTGGTA AGCCATCTTT AGTCGGTGAC GGGATTGATG CCCGCAAAGC CATTCTCGGT
GCCCCCAAGG GAGCGGCTGT CGATGCTAAT GGCGACATTT ATCTCGCCGA TACCGAAACA
CACACGATTC GTGTGATTCG AGCGAAGACC GGCCTGATTG AAACTGTGAT CGGTGATGGC
AAAGCCGGTG ATGGCCCGGA CGGGGAGGCA AAGACCTGCC GCCTCAACCG GCCCCATGGC
GTATTCATTA CCAAGGAGGG CTTACTCCTG GTCGGAGACA GTTCCAACAA TAAAGTCCGC
GTTCTTCCGT TACGATAA
 
Protein sequence
MAMISDRQSM TIENVQNSNR AQRSHRAHRR SLLSVQRHRT VTMRSVIMRP GQTSFGPSNL 
MRLAALCVLL SLFAFAELNV AHAGEVKTIF GSGKDGFNGD QQPFLETHSG QPFGLVIGPD
GALYFCEYTG HIIRRLDLEK QTATTIAGTP GKKGFAGDGG PATKALMNEP HELRFTPAGD
IVIADMRTHT IRKIDGKTGM ISTLAGTGTA GFSGDGGPAE KAQLNMPHSI QIDPAGDLLI
CDTGNHRVRK VDMKTGLIST AYGTGERKPA KDGDPQVGTP LNGPRSIDFT PEGDMILALR
EGNAVYRFPK GEAKLIHIAG VGGKPSLVGD GIDARKAILG APKGAAVDAN GDIYLADTET
HTIRVIRAKT GLIETVIGDG KAGDGPDGEA KTCRLNRPHG VFITKEGLLL VGDSSNNKVR
VLPLR