Gene Plim_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2161 
Symbol 
ID9138865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2784823 
End bp2786523 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID 
ProductStage II sporulation E family protein 
Protein accessionYP_003630186 
Protein GI296122408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0242635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAT TGCTCCTGCT TCAAGGAGGG GAAGCCACCC CTTATGAAAT CGCTGGTGGC 
GAAGTGGTGC TGGGGCGACA TCCTGAATGC TCGATCCAGA TCAACTCGAA CATGGTTTCC
CGCAAGCATG CTCGAGTCTT TTCTCACGAT GATGGCTTCG TCATTGAGGA TCTCGGGAGT
GGTAACGGCA CGTTCCTGAA TGGCAAAAAA CTCGAAGCAG CGACAAAAAT CAAAGATGGC
GACCGCATTA AGCTGGGTCC CATTCTCCTG CGATTTGAAG ATCCGGATAA TCCTGCCAGT
CGCCCCGCGC TGGCTCCAGG AAGTGGAGTC AGTGGAGCCG GAAACCAGGC AGCTACAGCG
ACTTTCAATC TCGAGTTTGC CTCTGGCGAT GACGATGTCG CCACTGTGAT GGGAACATCC
GGGCGTGTCG AAGGGTTTGG CGCCCTGGAA GTTCAACCCG AAGCCAAACT GAAGGCCGTG
CTCGAAATCA GCCGTGCCTT AGCAGGCAGC ACTGATCTTG ATGGATTGCT CCCCAAGATA
CTCGACACGC TGTTCAATAT CTTTCCACAT GCGGATCGTG GCGTTGTTCT CTTCAAAGAA
GACGATGGCA AACTCATTCC GCGAGCGATT AAACATCGTC GCTCAGACGA AGATGAATCG
GTGAAATTGA GCCGGACAGT TCTTAACACT GTGCTCGAGC AGAAAACAGG GATTCTTTCG
GCAGACGCAA CGAACGATTC TCGTTTTGAA GCCAGCGAAT CAATCTCGGC TCTCACCATC
CGCTCGATGA TGGCTGTCCC CATGCTGAGC GTCGCCGGTG ATGTTCTGGG TGTGATTCAT
ATCGATACTC AGAATGCCTT CAACCAGTTT AAAAAAGATG ACCTCGATCT GTTGATCGCG
GTTGCTGGTC AGGCGGGTCT TTCTTATGAA ACCGCTCGAC TCATGGTGAC GGCTCTGGAA
AAACAGAAGC AGGACCGTGA AATGCAGATT GCCGCCAATG TGCAACTGGC CCTGCTGCCG
GAAAGTCTTC CCAAAGTCGA TGGTTACCAG TTCTACGCCT CTTACGATTC GGCACAGGCA
GTAGGTGGCG ATTACTACGA CTGCATGCAA CTCGAAGGTG ATCGCGTCTT CTTTGCTTTT
GGCGATGTGG CAGGGAAGGG TGTGCCTGCT TCACTGGTCA TGTCCCGAAT TTCCAGCGTC
GTGCAGAACG TGATGGCCTT CGTGACAGAC GTTGGCGTCG CTGTCGGACG AATTAATAAT
CAGATGTGTG CAAAAGCTGT CGAAGGCCGG TTTGTGACCT TCGTCCTGGG CGTCATTCAT
ACGCAAACGG GCGAAATGTC TCTCGTAAAT GCCGGCCACA TGCCCATCAT GATCCGCAAG
GCCGATGGGA CAGTCGAAGA ATTCGGAGCC GAAGCCGTCG GTATTCCTTT AGGGGTCATG
GAAGATTACC CCTTCGATGT GGTCACGAGA CAAATTGCAC CAGGCGAGAC ATGCCTGATC
TACACCGATG GTGTCAGTGA GGCCATGAAT CACAACAGTG ATCTTTACGG CATTGAACGG
ATTCGCGAAC TGATGCATGC CCACGGGCAT GAAGGGGCCG AAGAGTTAGG ACGGACGATC
CTGCAGGATG TCCGCCGCCA TGCCAATGGT CGTCCTCAGA ACGACGACAT CACGCTGATG
CTCTTCAGCC GTCTTGGCTG A
 
Protein sequence
MAKLLLLQGG EATPYEIAGG EVVLGRHPEC SIQINSNMVS RKHARVFSHD DGFVIEDLGS 
GNGTFLNGKK LEAATKIKDG DRIKLGPILL RFEDPDNPAS RPALAPGSGV SGAGNQAATA
TFNLEFASGD DDVATVMGTS GRVEGFGALE VQPEAKLKAV LEISRALAGS TDLDGLLPKI
LDTLFNIFPH ADRGVVLFKE DDGKLIPRAI KHRRSDEDES VKLSRTVLNT VLEQKTGILS
ADATNDSRFE ASESISALTI RSMMAVPMLS VAGDVLGVIH IDTQNAFNQF KKDDLDLLIA
VAGQAGLSYE TARLMVTALE KQKQDREMQI AANVQLALLP ESLPKVDGYQ FYASYDSAQA
VGGDYYDCMQ LEGDRVFFAF GDVAGKGVPA SLVMSRISSV VQNVMAFVTD VGVAVGRINN
QMCAKAVEGR FVTFVLGVIH TQTGEMSLVN AGHMPIMIRK ADGTVEEFGA EAVGIPLGVM
EDYPFDVVTR QIAPGETCLI YTDGVSEAMN HNSDLYGIER IRELMHAHGH EGAEELGRTI
LQDVRRHANG RPQNDDITLM LFSRLG