Gene Plim_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3720 
Symbol 
ID9140438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4783172 
End bp4784464 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content57% 
IMG OID 
Productprotein of unknown function DUF1501 
Protein accessionYP_003631731 
Protein GI296123953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCCAG TCATTCACGA TCTTGTTCAG CTTTCGAGAA ACCACTTTAC CCGCCGACGT 
CTGCTGCAAT CTACTGCGGC TGGGATGGGT GCGATGGCTG CCGGTGGTTT GCTGCCAGCT
TGTCTGTCGG CTCAGGCGGC TGATCTGCAG CAGAAGAAGC GTTCGATCAT TGTCCTCTGG
ATGCAGGGAG CACCCAGCCA GTTCGAGACC TTCGATCCCA AGCCTGGTAC GGAAACAGGC
GGCCCTACGA AGTCCATCTC GACAGCCGCC CCGGGAATTC AGATTGCCTC GACCTTCCCG
CAAGTTGCGA AAATGATGAA CGAGATTGCC CTGATCCGTT CACTCACGAA TAAAGAAGGG
AATCATCAGC GGGCGACTTA CCAGTTGCAT ACAGGCTACA TTCCCACCGG TTCGGTCAAG
CATCCTTCAC TGGGGGCGAA TATCTCCCGA CAGATTGCTC CTGCCGGGCA GGATCTGCCG
TCGCTGGTCA CCATCGGGAA TGCGATAGCC GGGATTGGTG CTGGGTATCT GGGAATCAAC
TACGAGCCGC TGCACCTCAA TCAGGCCGGT AAGATTCCCG ATAACGTCAC GATTGGAACG
AGTACCGAAC GCTTTGACCG ACGGCTGGGT CTACTCGGCC AGATGGATCA GCAGTTTGCC
GAACGTGGTG GAGCCTCTGT CGTGCAGACA CATCGCGATC TCTACTCAAA GGCATCAGGG
ATGGCTCAGT CGAAGGATCT GAAGGTCTTC GACCTCGATG AAGAACCAGC CGCTCTCAAG
GAGGCTTATG GCGATACCAA CTTTGGGCGT GGTTGCCTTC TGGCGCGCCG TCTTGTTGAA
GCAGGTGTCA CTTATATCGA AGTGCGCGTG GGGAACTGGG ATACCCATGC CGATAACTTT
GATGCGACGA CCCGGCTGGC TGGGGAAGTT GATCCGGCGG CAGCCACTTT GATTCGAGAC
CTCAAAGACC GTGGCCTGCT CGATTCGACA CTCGTGGTCT GGATGGGTGA GTTTGGCCGC
ACTCCCAAAA TCAATGCCCG CACAGGTCGC GATCACTTCC CGAAAGCATT TAACGGCTTC
CTCGCGGGAG CCGGTATTCG CGGTGGTCAG GTGATTGGAC GCACCAACGC CGAGGGGACA
GAGATCGAAG ACCGACCAGT GACTGTGGGT GATCTCTTCA CATCGATCTG TGCGGCTATG
AAGGTCAATC CCAAGGATGA AACCATGAGC CCCCAGGGCC GACCTCTCAA GGTCATTGAA
TCGGGCGAAG TGATTCAAGG ACTCTTCGCC TGA
 
Protein sequence
MSPVIHDLVQ LSRNHFTRRR LLQSTAAGMG AMAAGGLLPA CLSAQAADLQ QKKRSIIVLW 
MQGAPSQFET FDPKPGTETG GPTKSISTAA PGIQIASTFP QVAKMMNEIA LIRSLTNKEG
NHQRATYQLH TGYIPTGSVK HPSLGANISR QIAPAGQDLP SLVTIGNAIA GIGAGYLGIN
YEPLHLNQAG KIPDNVTIGT STERFDRRLG LLGQMDQQFA ERGGASVVQT HRDLYSKASG
MAQSKDLKVF DLDEEPAALK EAYGDTNFGR GCLLARRLVE AGVTYIEVRV GNWDTHADNF
DATTRLAGEV DPAAATLIRD LKDRGLLDST LVVWMGEFGR TPKINARTGR DHFPKAFNGF
LAGAGIRGGQ VIGRTNAEGT EIEDRPVTVG DLFTSICAAM KVNPKDETMS PQGRPLKVIE
SGEVIQGLFA