Gene Plim_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_4039 
Symbol 
ID9140759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5181622 
End bp5182737 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content53% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003632049 
Protein GI296124271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.970162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCACG TTGATCTTCC CTCTCGAGAT GATATTGCCA CCCTGATGAA TTCACGGGGT 
GAGATCAAGG TCAGCATTTA TCTTCCCACG ACGCCATTTT CTCAGCAGGC CCAGCAGGAT
CGGATTGTCC TCAAAAATCT GACAAAAACT GCGATTGATC AACTGGCAGA GCGTCCGAAA
AAAGATGTCG AAGCCATTGA AGGACTGCTT CTTGATCTGG TAGATGATGG ATCGTTCTGG
GAATACCAGG CACATGGCCT CGCTATTTTT GTGACACCCA CACAGATTCA TACTTTAAGA
CTTCCCTACA GCGTTCAAGA GCTGGTGGAA GTCAGTGATC GATTCCACGT CAAGCCATTG
CTGCATCCAA TGGCGGCTTC TTCAACAGGC TTTGTCCTGG TGCTTGGTCA GAATAATGTC
AAGCTGCTGC AGATCTGCTC CGACCTGCCC GCAGTGACAT TGAACATCGA CGGCTTGCCG
AAGGATGCAG CAAGTTCTGT TGGCAAGTCG TCCATTCAGG ATCGCTCACC CAGCGGACGA
ATTCAGGGAG ACGAAGGGAA GAAAGTTCGT CTCACACAAT ATGCACGCCT GGTTGATCAG
GCACTCCGGC CGGTTCTCAA TGGACGCAGT GAACCTCTGA TTCTGGCAGC AACAGAACCC
TTGCTTTCCA TTTACCGTCA GTTGGCAACG TATCCATTTC TTGCTGCCGA AGAAATTCGC
CACAGCCCGG ATGCGATTTC CGATGTCGAG ATTGTGGCTG CCGCCCGCAC CATCTTCCAG
AACCTCGCCA GTGCCCGCAT TCAATCGGCA CTCGAGACCT TCGAGCAGCG AAAAACACAA
AATCGCACAA CTACTAATCT CGAAGAAATC TCCGTCGCTG CCACACAAGG CGCAGTGCAA
TCCTTAATTG TCGATGTCGC CCGAGTCACC CCCGGGACCA TCGATGAGCA CGGCAAAATT
ACGCCAGGTG CCGCCAATTG CCCGGTCAGT TACGATGTCG TCGGTGAAAT CTGTGCTCGC
GTCATGATGA CCGGAGGGAC TGTCCTCGCA GCCGGAGGCG AACAGGTTCC CGGTACATGT
GGGCTGGCAG CTACTCTTCG TTATGCTCCA CAGTAG
 
Protein sequence
MLHVDLPSRD DIATLMNSRG EIKVSIYLPT TPFSQQAQQD RIVLKNLTKT AIDQLAERPK 
KDVEAIEGLL LDLVDDGSFW EYQAHGLAIF VTPTQIHTLR LPYSVQELVE VSDRFHVKPL
LHPMAASSTG FVLVLGQNNV KLLQICSDLP AVTLNIDGLP KDAASSVGKS SIQDRSPSGR
IQGDEGKKVR LTQYARLVDQ ALRPVLNGRS EPLILAATEP LLSIYRQLAT YPFLAAEEIR
HSPDAISDVE IVAAARTIFQ NLASARIQSA LETFEQRKTQ NRTTTNLEEI SVAATQGAVQ
SLIVDVARVT PGTIDEHGKI TPGAANCPVS YDVVGEICAR VMMTGGTVLA AGGEQVPGTC
GLAATLRYAP Q