Gene Sala_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1831 
Symbol 
ID4082183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1925390 
End bp1926598 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID638010206 
ProductHI0933-like protein 
Protein accessionYP_616876 
Protein GI103487315 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000611752 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.363311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTGGC GATCGCAAAG CTGCACTGAC GCCTGCGTGG CAGTATTCGA TTATGACGCC 
ATCGTGCTGG GTGCCGGCGC GGCCGGGCTG ATGTGCGCCG CGGTCGCGGG GCAGCGGGGG
CGGCGCGTCC TGCTGCTCGA TCACGCCGAC GAGGTGGGCA AGAAAATCCT GATTTCGGGC
GGCGGTCGCT GCAATTTCAC CAACATCTAC ACGCGCCCCG AAACCTATAT TTCGGCGAAC
CCGCATTTCG CGAAGTCGGC GCTCGCGCGC TACGCCCCCG CCGACTTCAT CGCGCTCGTC
GAGCGCCATG GCATCGCATA TCATGAAAAG ACGCTCGGCC AGTTGTTCTG CGATGGATCG
GCGAAGCAGG TCGTGGCGAT GCTGCTCGAC GAATGCGCAA GGGGCGGCGT CGACGTCCGC
TGCGGGCAGC CGGTGCGTCA GGTTACCCAC GCCGACGGTC GGTTCGCTGT CCGCTTCGGC
GACCTGGACT TCGCCGCCCC CAATCTTGTC ATTGCGACAG GCGGGCCTTC GATCCCGAAG
ATGGGCGCGT CCGGTTTCGC TTATGATCTC GCCCGCCAAT TCGGCCTCAA GGTCGTCGAG
CCGCGCCCTG CGCTCGTCCC GCTGACGCTT GGCGGCGACG ATGTGCTGTT CCGCGAGTTG
TCGGGCGTTG CGACGCCGGT CGAGGCGCGC GCGGGCAAGG CGGCGTTTCG CGAAGCCGCG
CTCTTCACGC ACAAGGGGCT TTCCGGTCCG GCAATCCTTC AGGTCAGCAG CTATTGGCGC
CACGGCGAAC CGGTGACGAT CGACTTTTTG CCCGATGCCG CGCCGGGCTG GCTGCTCGGA
GCGAAGCGCG CGCGCCCGCG CGCGACGCTC GCTTCGGCGC TCGCGCTCCC CGACCGCCTT
GCGCAGACGC TCGCCGACCG CCTTGCGCTC CCCGGCGAAC TCGGCGCCCA GACCGACCGC
AAGCTCGCCG ACGCCGAAGC GCGCCTCAAA CGCTGGACTT TCCGCCCCAA CGGCACCGAA
GGCTTCGCAA AGGCCGAGGT CACCGCCGGC GGCATTTCGA CCGCAAACCT GTCCTCGCAA
ACAATGATGG CCAAATGCGT GCCGGGACTG TATGCGGTCG GCGAAGCCGT GGACGTCACC
GGGTGGCTGG GCGGCTATAA TTTTCAATGG GCCTGGGCCA GCGGACACGC GGCGGGCCAG
GCCCTTTAG
 
Protein sequence
MGWRSQSCTD ACVAVFDYDA IVLGAGAAGL MCAAVAGQRG RRVLLLDHAD EVGKKILISG 
GGRCNFTNIY TRPETYISAN PHFAKSALAR YAPADFIALV ERHGIAYHEK TLGQLFCDGS
AKQVVAMLLD ECARGGVDVR CGQPVRQVTH ADGRFAVRFG DLDFAAPNLV IATGGPSIPK
MGASGFAYDL ARQFGLKVVE PRPALVPLTL GGDDVLFREL SGVATPVEAR AGKAAFREAA
LFTHKGLSGP AILQVSSYWR HGEPVTIDFL PDAAPGWLLG AKRARPRATL ASALALPDRL
AQTLADRLAL PGELGAQTDR KLADAEARLK RWTFRPNGTE GFAKAEVTAG GISTANLSSQ
TMMAKCVPGL YAVGEAVDVT GWLGGYNFQW AWASGHAAGQ AL