Gene Plim_3931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3931 
Symbol 
ID9140649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp5050060 
End bp5051283 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content57% 
IMG OID 
ProductSulfite oxidase 
Protein accessionYP_003631941 
Protein GI296124163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCACA TCCCTCACCC CTCCAGCCGC CGTCACTTTA TGGGTCAATC GGCAGGGCTT 
CTGGCTGCAA TGGGTCTGTG GAATACCAAC TTCGGGCAAC TCGAACGCCC ACTGCGTGCC
GACGAACCGG CTCTCGACCT GATCAGTGCT GATATCAAGC GGCTCATTAC CAGGCAGGCC
ACCCCTTATA ATGCCGAGCC ACCCGCTCCA GAACTGACTT CGCAATGGCT CACACCCACC
CGCAGCCTGT ATGTTCGCAG CCATGGGAAG ATTCCCCGGA TTGACGAAAA GAGTTTCCGG
CTCAGCATCA GTGGTCTTGT CGAGCGTCCC GTCACGTTCA CTTTGCCGGA ACTTCAAGAA
CGCTTCCCCG CCACTTCGAC TCTCGCCACC ATGGTTTGTG CCGGTAATCG GCGCAATGAA
TTCGCACTCG GCGGACGCAA AGCGCCGGGT GTCCCCTGGG ATGTCGGTGC GATTGGAACG
TGTGACTGGC AAGGCGTTTC ACTCGCTCAA GTTCTTAAGC ATTGCGGCGT CAAAGCCGAG
GCCAAGCATA TCTGGTTTGA AGGACTGGAT GCCGTCGAAG GCCACGGAGA TCCCTTTCCT
TTCGGGGCCT CCATTCCCCT CGAAAAGGCA ATGCAGGATC AGGCGGGAGA AGAAACTTTG
CTCGCCTGGG GGATGAATGG TGACCCCCTG CTTCCCGAGC ATGGCTTCCC GCTGCGTAAT
GTCGTCCCAG GCTACATTGG TGCCCGCAGT GTGAAATGGC TGGCCAAAAT TGTTGTGAGT
GACAAACCCT CGCCCAATTT CTTTGTGCAG GATGTGTACA AGGTTGTTTA CGACGACAAA
CCCGAATCAA CAGCGAATGC CGCTCCGATT ATGGAGTTTA TCCTGAACTC GGCCATCACC
GAAATTCGCC GCTTCGATAA CAACCTGCGC ATTCGCGGCT TCGCACTCGC TCAAGGGAGC
CTGGGCAACG CGATTGAACG CGTTGAAATC AGCACCGATC GCGGCAAGAG CTGGCAGCCG
GCCCGGATCA TCACCGCTCC CGGGAAAAAC ACCTGGTCAT TATGGTCAGC GACAATCCCG
GCTGAGGGAG TGAAGACAGT CACTGTCCGC GCCTTTGATC GTGCGGGAAA CTCACAGCCC
GAATCGCAGA AGTTCAATAT TAAAGGCTAT CAACTCAACT CCTGGCACAC ACTCCCCGTA
GCAGGTGCCG GTCAGGGCGC GTAA
 
Protein sequence
MLHIPHPSSR RHFMGQSAGL LAAMGLWNTN FGQLERPLRA DEPALDLISA DIKRLITRQA 
TPYNAEPPAP ELTSQWLTPT RSLYVRSHGK IPRIDEKSFR LSISGLVERP VTFTLPELQE
RFPATSTLAT MVCAGNRRNE FALGGRKAPG VPWDVGAIGT CDWQGVSLAQ VLKHCGVKAE
AKHIWFEGLD AVEGHGDPFP FGASIPLEKA MQDQAGEETL LAWGMNGDPL LPEHGFPLRN
VVPGYIGARS VKWLAKIVVS DKPSPNFFVQ DVYKVVYDDK PESTANAAPI MEFILNSAIT
EIRRFDNNLR IRGFALAQGS LGNAIERVEI STDRGKSWQP ARIITAPGKN TWSLWSATIP
AEGVKTVTVR AFDRAGNSQP ESQKFNIKGY QLNSWHTLPV AGAGQGA