Gene Plim_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2066 
Symbol 
ID9138769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2678616 
End bp2679782 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content56% 
IMG OID 
ProductMammalian cell entry related domain protein 
Protein accessionYP_003630092 
Protein GI296122314 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAGC GTCAGTTGCA GTTTCGCGTT GGAATGATGG TGCTGGTGGC CATGGCCATC 
GGTGTCGGTC TGCTCGTTCG CGCGGGAAAA CTGGATTCCT ATTGGGATGA AGATTTCAGT
ATCGCCATCC AGTTTGAATC GGCCGGGGGG ATTTATCCCA GTGCACCCGT CCGACTTTAC
GGACTGACGA TTGGAAATGT TCGCGATGTC CGCCTGGATA ACAAACGTCG AGGCGTGATT
GTCATTGCCG AAATCGACGC CAAGCACAAA CTTCCGATTG ATTCCACTGC CCAGGTGGCC
GTGAGCCTTT TGGGCGAGGG GCATCTGGAA ATCATTCCTG GCCTGTCGGA AGAACCACTC
AAACATGGCG CAGTGATCAG TGGTCAAGCT GCTGGTGATC CCATGGCTTT GGTCGCTCGA
CTTGAGGCCA AGACCACCGC CACGATGGAT TCCTTTGCCG CCACGAGCAA GGAATGGGGC
ACACTCGCCC ATAACGTCAA CAATCTTCTC GAAACCAAAC GCGGGAACAT CGATCAGGTC
ATCGAGCGGG CCGCCGACTC GTTGGATCAA CTTTCACTGG CCATGAAATC CGCCACTGAG
TTAATCCAGC AGGCCAATCG CATTGTTGGT GATCCCAAAA CTCAAGCAGC ACTCCAGCAG
ACCGCTCAAT CACTCCCTCG TCTGGTTAAT GATACTCGAG AGACCATTGT CGTAGCCCGC
ACGACGCTCG AAAGCATGCA GCAGAATCTG AAAAACCTCG AATCGGTCAC CGATCCACTG
GCCAAAAAAG GGAACGATAT GATTGTCCGG CTCGACACCA GCCTGGCCAA TCTGGATCGT
CTTCTGGCCG ATGCCAGTCG GTTTGTCCGG ACCCTGAATA CTCAGGATGG CACACTGCAG
AAACTGGCGG CTGATCCCCA GCTCTACGAC AACCTGAACC GTTCGGCCCA ACTGGTGACA
GTCCTCCTGC GCGGCATTGA ACCGATCGTT CAGGACATGC GGGAGTTCAG TGATAAAGTC
GCTCGCCGCC CCGAGATTCT CGGCGTTGGT GGAGCCATTC AACCCAGCAA CGGCCTGCGC
GATACCGAAC TGATCGAGCA AAGTGGCGGA ACAGCCCCCA AAACCCAGCA GAAATCGGTA
CGACCGAGTT TCCTGCCGGG AAGATAA
 
Protein sequence
MSERQLQFRV GMMVLVAMAI GVGLLVRAGK LDSYWDEDFS IAIQFESAGG IYPSAPVRLY 
GLTIGNVRDV RLDNKRRGVI VIAEIDAKHK LPIDSTAQVA VSLLGEGHLE IIPGLSEEPL
KHGAVISGQA AGDPMALVAR LEAKTTATMD SFAATSKEWG TLAHNVNNLL ETKRGNIDQV
IERAADSLDQ LSLAMKSATE LIQQANRIVG DPKTQAALQQ TAQSLPRLVN DTRETIVVAR
TTLESMQQNL KNLESVTDPL AKKGNDMIVR LDTSLANLDR LLADASRFVR TLNTQDGTLQ
KLAADPQLYD NLNRSAQLVT VLLRGIEPIV QDMREFSDKV ARRPEILGVG GAIQPSNGLR
DTELIEQSGG TAPKTQQKSV RPSFLPGR