Gene Sala_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0833 
Symbol 
ID4080041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp839060 
End bp840250 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content75% 
IMG OID638009192 
ProductMoeA-likedomain-containing protein 
Protein accessionYP_615884 
Protein GI103486323 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.117387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00577528 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGCAC TGCTCCCCGT CGAGGAGGCG CAGGCGCGCC TGCTCGCGCT GCGCGCGCCG 
CTGGGACGCG AGACGGTCGC GCTCGCCGAC GCGCATGGCC GCTATCTCGC CGCCGATGTC
CTGGCGGAGC GCGACCAGCC CGCGGCGCCG CTGTCGGCGA TGGACGGCTA TGCGATCCGC
TTCGACGACC TGCCCGGCCC ATGGACGGTG ACCGGCGAGG TTGCGGCAGG CAGCGCTCCC
GACCGCGCCG TCGGCGCGGG CGAGGCGCAG CGCATCTTCA CCGGCGCGGT CGTTCCGCCC
GGCGCCGATA CGGTGATCGT GCAGGAGGAT GTTGCGCGCG AGGGCGACCG GCTGACCCTG
ACCGGCGACG GGCCGGACAG CCGCGGCCGC CACATCCGCG CGCGCGCCGC CGATTTCGCC
GCCGGCGACG CGCTGCTCGC GGCGGGTTCG CGATTGACAC CCGGCGCCAT CGCCACCGCC
GCGATGAGCG GCGCGGGCGC GCTTTCCGTC GCTATCCGCC CGCGCGTCGC GATCCTCACC
ACCGGCGACG AACTGGTCGC GCCGGGTCGC GCGCCCGGCC CCGGTCAGAT CCCCGACAGC
AACGGCGTGA TGCTCGCCGC CATGCTCGCG GGCGAGGCCG CCGCGCCCGT GCAGCCGCGC
CACATCCGCG ACGACCGCGC GACGCTCGCA AAGATTCTGA AGGAGCTGGC GCGGAGTCAC
GACGTCATCG TCACCGTCGG CGGCGCGTCG GTCGGCGATC ACGACCATGT CCGCGGCGCG
CTGGGTGACG CGGGCGGGCG GCTCGATTTC TGGAGGATCG CGATGAAGCC CGGCAAGCCG
CTGATCGCCG GCACGCTCGG CGACGCGATC CTGCTCGGCC TGCCCGGCAA TCCCTCGTCG
GCCTTCGTCA CCGCGACGCT CTTCCTCCTC CCGCTCGTCC GCCACCTCGC GGGTGCGCGC
GCGCCGTTGC CGCCGGTACA GCGCGCGCCG CTCGCCGCAC CGCTCGACGC CGGCGGCACG
CGCCGCGACT ATCTGCGCGC GCGGGTCGAG CGGGGCGTGC TGACCCCGCT CGTCGGACAG
GAAAGCGGCC GCACCCTCCC CCTCGCCGCC GCCAATGCGC TGCTCATCCG CGACATCGGC
GCCCCCGCGC GCGATGCCGG CGACGCGGCG GACTATATCG CCATCGCTTG A
 
Protein sequence
MSALLPVEEA QARLLALRAP LGRETVALAD AHGRYLAADV LAERDQPAAP LSAMDGYAIR 
FDDLPGPWTV TGEVAAGSAP DRAVGAGEAQ RIFTGAVVPP GADTVIVQED VAREGDRLTL
TGDGPDSRGR HIRARAADFA AGDALLAAGS RLTPGAIATA AMSGAGALSV AIRPRVAILT
TGDELVAPGR APGPGQIPDS NGVMLAAMLA GEAAAPVQPR HIRDDRATLA KILKELARSH
DVIVTVGGAS VGDHDHVRGA LGDAGGRLDF WRIAMKPGKP LIAGTLGDAI LLGLPGNPSS
AFVTATLFLL PLVRHLAGAR APLPPVQRAP LAAPLDAGGT RRDYLRARVE RGVLTPLVGQ
ESGRTLPLAA ANALLIRDIG APARDAGDAA DYIAIA