Gene Plim_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_1928 
Symbol 
ID9138630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2516040 
End bp2517170 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content52% 
IMG OID 
Productbiotin synthase 
Protein accessionYP_003629957 
Protein GI296122179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000208086 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCAA CTGGGTCTTC GAAAGAAAAC TGCTCCACAA AGATTTGCTG GAGTGATCTT 
GCTGACAAAG TGATGGGCGG GCACGTGCTG ACCCGTGAGG AAGCTCTAGC CATTCTGGAT
TCAGAAGACG ATGAGATCGT GGATCTGCTG GCAGCTGCTT ACAAAGTGCG GCGTAAGTAT
TTCGGGAATA AAGTTCAGCT TTACTTCCTG AAAAATGCCA AGAGTGGTCT CTGCCCAGAG
GATTGTGGTT ACTGCTCTCA ATCCAAGATT GCCGAAACTG AAATTCCTAA GTATGCCATG
CTGAATGAAG CCAAGCTCAT GGAAGGAGCA GCGCGAGCTG TCGAAGCCAA GGCCCGAACC
TATTGCATCG TGGCTTCAGG ACGCGGCCCT TCCAACCGAG AAGTAGGGCA TGTCGCCAGC
GTTGTCAAGA AAATCAAAGA GACCTATGGA CTGCATATCT GCTGCTGCCT GGGTCTGTTA
TCGCCCGATC AGGCCAAAAC ATTAGCAGAA GCCGGGGTTG ATCGGATCAA CCATAACCTG
AACACAGGTC GCGAGTTTTA CGACAAGATC TGCACGACTC ATACCTATGA TGACAGGCTG
GAAACACTGA AGGTGGTTCG TGAAGCCGGT ATGGAGCTAT GCAGTGGCCT GATTGTGGGC
ATGGGTGAAA CCCAGAACGA TCTGGTTGAT GTCGCTTTTG AATTGCGGGA ACTGGGTGTG
GAATCGACCC CGGTCAATTT TCTGCATGCC ATCGATGGTA CTCCTCTCGA AGCTCGGCAG
GAATTGAATC CCCGCCAGTG CTTGAGAGCT TTGTGTCTGT TCCGCTTTGC CAATCCGGCT
GTGGAACTGA GAGTTTCCGG CGGACGTGAA GTGAATCTGA GGTCGATGCA GGCGATGAGC
CTGTATGCTG CCAACAGTAT GTTCGTCAGC GACTATCTCA CGACTAAAGG GCAGCCGGCT
GAAGATGATT TCAAGATGGT AGCCGACCTG GGGATGGAAG TCGTGATCGG TGATCATGAC
TCTTTTCTCG CATGGAAGGC AGTTCAGGAA AGTCAGCCCC AAACCAATTG CTGCGAGGGA
ACTTCGACCT GTGTAACCCC TGAGAAAACA GCGGCAGGTT GTCATGCCTA G
 
Protein sequence
MSATGSSKEN CSTKICWSDL ADKVMGGHVL TREEALAILD SEDDEIVDLL AAAYKVRRKY 
FGNKVQLYFL KNAKSGLCPE DCGYCSQSKI AETEIPKYAM LNEAKLMEGA ARAVEAKART
YCIVASGRGP SNREVGHVAS VVKKIKETYG LHICCCLGLL SPDQAKTLAE AGVDRINHNL
NTGREFYDKI CTTHTYDDRL ETLKVVREAG MELCSGLIVG MGETQNDLVD VAFELRELGV
ESTPVNFLHA IDGTPLEARQ ELNPRQCLRA LCLFRFANPA VELRVSGGRE VNLRSMQAMS
LYAANSMFVS DYLTTKGQPA EDDFKMVADL GMEVVIGDHD SFLAWKAVQE SQPQTNCCEG
TSTCVTPEKT AAGCHA