Gene Plim_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1121 
Symbol 
ID9137808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp1423815 
End bp1425272 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content58% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003629155 
Protein GI296121377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTG AAATTGATCC TTCCACGCCG CGGTCGATTG TCCGGCGGGA TTTTGTCAAA 
GCCTTGACGA TGGCTGGTCT GGCCGCCATG GCCACAGGCG AACCACGAGT CGTACGGGCC
AATTCCGAAG AGCCGCTGGT CCATCCCAAA CCGACGGCCG ATGCCTGCAT TCTGCTGTGG
ATGGCCGGCG GTATGGCGGC TCCCGACAAC TTTGACCCCA AACGCTACCG CCCTTTCGAA
AAGGGCCTGG CCGTCGCGGA GATGCTCAGC ACGTTCCCCG CCATCAATAC CTCCATTGAT
GGTGTGCAGA TTTGCGAAGG GCTCGAAAAC ATCGCGCAGA TCCTCCATCG CGGGACGCTC
ATTCGCAGTG CTGTCCAACC CGATCTGGGG AGCATTCTCC ATAGCCGCCA TCAGTATCAC
TGGCATACAG GCTATGTCCC TCCCCAGACA GTCGCCTGCC CGCATTTAGG TTCGTGGATG
GCAAAGGTCC TTGGGCCGCG AAATCCCGTC ATGCCCGCGT TCATCAATAT CGGGCAGAGA
CTGGAAGGTG TCGGTGAAAG CGAAGAACTC AAGGCGTTTA CGACAGCCGG ATTCTTTGGC
AGCGAGTTCG GGCCGATGAA CCTTCCGTTC CCGGAAGAAG CGGCTCAATC GGTCAAGCCG
CCCCAGGGCA TGACCAACGC GCGATTCGCC AATCGTGAGC GGCTCTTCCG GCAGTTGATC
GACAAAAATC CCCATCGCGA CCAGCTCAGC GATTATCAGC AGCAATCGAT GCTCCGCAGT
CTCGACAACG CCTACCGTCT GCTCAGTTCC AAAGAGCGGG AAGCGTTTGA TATCACGCTG
GAACCGAAAG AGGTTCAGGA AAAGTACAAT ACCGGTCGCT TTGGCAGAGG TTGTCTCCTT
GCCCGCCGAC TCATTGAGAA TGGCGCACGG TTCGTGGAAG TCACGACCGA GTATGTTCCG
TTCCTGCACT GGGACACGCA TGCAAATGGG CATGAAACCG TGGCCCGCAT GCACCAGGAA
ATTGATCGCC CGATTGCCAC ACTCGTTCAA GAACTCGATG AACGAGGGCT GCTCGATCGC
ACCCTGGTGA TTGTCGCCAG CGAGTTCAGT CGCGACGCCC TCATGGAAGG AAAACCCGGC
TCAAACGCAG GGGATCAAGC CGCCTTCCGC GACGACAACA TCAGTGAAAT GGCCCATTAC
GGCCTGCATC GACATTTCAC TGGCGGCAGC AGCGTCCTGA TGTTTGGCGG TGGCATGAAA
AAAGCCTACG TCCATGGCCA GACGGCTGAT GAACGCCCGC TGATTGCCAT CAAAGATCCC
GTCACCGTCA TGGACTTGCA TGCCACAATC ATGACGGCCA TGGGCATCAG CCCCAAGACG
GCCTATTCCA TTGAAGGCCG CCCATTCTAT GTCACAGAAG ATGGTAAAGG CCTCCCCGTC
CAGCAGCTCT TCACCTAA
 
Protein sequence
MAFEIDPSTP RSIVRRDFVK ALTMAGLAAM ATGEPRVVRA NSEEPLVHPK PTADACILLW 
MAGGMAAPDN FDPKRYRPFE KGLAVAEMLS TFPAINTSID GVQICEGLEN IAQILHRGTL
IRSAVQPDLG SILHSRHQYH WHTGYVPPQT VACPHLGSWM AKVLGPRNPV MPAFINIGQR
LEGVGESEEL KAFTTAGFFG SEFGPMNLPF PEEAAQSVKP PQGMTNARFA NRERLFRQLI
DKNPHRDQLS DYQQQSMLRS LDNAYRLLSS KEREAFDITL EPKEVQEKYN TGRFGRGCLL
ARRLIENGAR FVEVTTEYVP FLHWDTHANG HETVARMHQE IDRPIATLVQ ELDERGLLDR
TLVIVASEFS RDALMEGKPG SNAGDQAAFR DDNISEMAHY GLHRHFTGGS SVLMFGGGMK
KAYVHGQTAD ERPLIAIKDP VTVMDLHATI MTAMGISPKT AYSIEGRPFY VTEDGKGLPV
QQLFT