Gene Plim_3387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3387 
Symbol 
ID9140103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4381434 
End bp4382645 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content56% 
IMG OID 
Producttype II secretion system protein E 
Protein accessionYP_003631399 
Protein GI296123621 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.729797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTT CTCTCCATCC ATTAACCAAA AACGATTTGA AGATTGATGG CTCGTTGGCT 
CGATGTCAGT TGCCGTGGGA GAACCTGAAT CTGCAGGACG CTGACTTTGC TGTGCGGGCC
GTCGAACAAT TGCTTCGCCT GGCGATTGAT GAGCGTGCGA GTGATGTGCA CCTGGTTCCC
TCCCCTGCAG GATTGCAGAT TGCCTTCCGG CAAGATGGTT TGCTGGAGCC GATGGGGAGC
TTTCCACCTG AGGTTTCGCC ACTGATCATC AATCGCATCA AAGTTCTGGC TCAACTGCTG
ACTTATCGGA CGGATCTGCC TCAGGAAGGC CGCCTCAGGC TTCCCGAGTT TCCGGGGGAA
TTGCGGGCGA GTACTTTCCC GACCATTCAT GGCGAAAAGG TAGTGGTGCG CCTCTTTATT
GGTTCCGGGC AGTATCGAGT GCTGGATGAG CTGGGCTATA CACCTCAGGT GCAACTCGGA
TTGGAGCAGG CATTATTGCA GACCAGCGGG ATGCTGCTGT TGACCGGGCC AGCCGGGAGT
GGAAAAACGA CGAGCGCTTA TGCCTGTTTG AGGTGGCTGC AAGCCTATCG GAAAGGGCAA
TGCAGTCTGG TCTCGCTGGA GGATCCCGTC GAAGCGTTTC TTCCGGGAGT TTCGCAGACA
CAGGTTCGCA GGGGCACGGA ATTCAATTAT GCCCTCGGAT TAAGGTCACT TTTACGGCAA
GATCCCGATG TGATTTTTGT GGGGGAAATT CGCGATGCCG AGACGGCACA GACGGCCTTT
CAGGCCTCAC TGGCGGGTCA TCTGGTGATT TCCACGTTTC ATGCGGGCTC AGCCGGAGAT
GCGATTTGCC GGCTGACAGA TCTGGGAGTT GAACCTTTTC TCCTCCGCAC CGGCTTGATT
GCCGTGCTGT GCCAGCGACT CGTCAAACGA CTGGTCAAAG ATCGAGAGAC TCCCTTATCA
ACCCGGCGAT ATGAGGGGCG TTTTGTCGCT GCGGAACTGA TGGAGCCGGA ATTGCATCAC
CTGGCTCGAC CCATTATGCG CAAAGTGAAT GGCAAGCGTC TTGAGGAACT GGCAGCCCGG
CATGGTTTCG TTCCCTTACG GGAGGCGCTC GAAGAGGCTG TCCGTACCGG GAAGACCGAC
TTGCCCGAAG TTTATCGAAT TCTCGGGACT CGACCAGTGG AAGAACGCCG GGATGTTGCT
GAGGAATTAT AG
 
Protein sequence
MSISLHPLTK NDLKIDGSLA RCQLPWENLN LQDADFAVRA VEQLLRLAID ERASDVHLVP 
SPAGLQIAFR QDGLLEPMGS FPPEVSPLII NRIKVLAQLL TYRTDLPQEG RLRLPEFPGE
LRASTFPTIH GEKVVVRLFI GSGQYRVLDE LGYTPQVQLG LEQALLQTSG MLLLTGPAGS
GKTTSAYACL RWLQAYRKGQ CSLVSLEDPV EAFLPGVSQT QVRRGTEFNY ALGLRSLLRQ
DPDVIFVGEI RDAETAQTAF QASLAGHLVI STFHAGSAGD AICRLTDLGV EPFLLRTGLI
AVLCQRLVKR LVKDRETPLS TRRYEGRFVA AELMEPELHH LARPIMRKVN GKRLEELAAR
HGFVPLREAL EEAVRTGKTD LPEVYRILGT RPVEERRDVA EEL