Gene Sala_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0447 
Symbol 
ID4082995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp457723 
End bp458871 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content71% 
IMG OID638008805 
Productkelch 
Protein accessionYP_615501 
Protein GI103485940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.724357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.84708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGGA CCGTCTCGAT CACCCTGGCT CTGGTCATCG GACTTGCCGG CTGCGCGGCG 
AACGGGGAGG ATGCGGCCGG TCTGCAAGCC GCCGACCTTC GCTTCGAGGA CGGGCCGCGC
CTCGGCGCGC CGCGGGCTGC CCATCAACTG ATCGCGACCG CGGACGGCAA GCTGCTGGCC
ATTGGCGGAT GTGTGCGCTC GGGGTGCGAC GTCGGCCCCG CCAGCGCAAC CGTCGACATC
ATCGATGCGG CAAGCATGGC GCTGATCGGC AGTGGCCGCC TGCTCGCCGC GCATGTCCAG
CCATCGGCCG TCGCGCTGCG CGACGGCCGG GTGCTGATCA CGGGCGGCTG GATCGACGGG
CGCCCGGCCA CGGCGATCGA GATATTCAAC CCCGCAACGG GCAGGTCGGT CGCGGGACCG
GCGCTCGGCG GACCCCGCGC CAATCCCGCC GTCGTGGGGC TTGCCGACGG ACGCGTCCTG
ATCGCCGGTG GTTATGACGG CCAGGATGCG CTCGGCGATG CGCTGATCTT CGATCCCGCC
AGTGGAACGC TGTCGGCGAC GGGCAGGCTG GTCACGCCGC GCGCCGGGGC CAGCGCCACC
CTGCTGTCCG ATGGCCGGGT GCTGTTGGTT GGTGGCGGCC GTGCCGAACG GAGCCCCCGG
ATCGCGCTCG CGAGCGCGGA AATCTTCGAT CCGGCCACGG GGCGATTCGA GGCGGCCGGG
TCGCTGGCCC AGGGACGCTA CAAGCATGGC GCGCTCCGGC TCGACAATGG CGACGTGCTG
ATCGTCGGCG GCGCCACTGA ACGCGATTCC GCCGGGAAAC TGCGTTCGGT CGAACGGTTC
GACGCGGCCA CGGGCCGCTT CGTGGTTGCG GGGCAATTGC TCGCCGGACG CTACAAGCTG
GCCGATGCCC TGCTGCTACT GCCGGGCAAC CGCGTGCTCG TGGCGGCGGA CGACATGGCG
CCCGAGATTT TCGATGTCGC GCGCGGCCGG AGCAGCCGGG TCGATTACGA TCTGGGCGAG
CGCTGGAACT TCATGGCGAT GGTCCGTGTC GATTCGCGGC GGGCCCTGCT CGCCGGCGGC
TACAGCGAAA AGGGGATCGA CCCGACCGAT CGAAGCTGGG TCATCCATCT GCCCACGGGG
GCGTCGTGA
 
Protein sequence
MIRTVSITLA LVIGLAGCAA NGEDAAGLQA ADLRFEDGPR LGAPRAAHQL IATADGKLLA 
IGGCVRSGCD VGPASATVDI IDAASMALIG SGRLLAAHVQ PSAVALRDGR VLITGGWIDG
RPATAIEIFN PATGRSVAGP ALGGPRANPA VVGLADGRVL IAGGYDGQDA LGDALIFDPA
SGTLSATGRL VTPRAGASAT LLSDGRVLLV GGGRAERSPR IALASAEIFD PATGRFEAAG
SLAQGRYKHG ALRLDNGDVL IVGGATERDS AGKLRSVERF DAATGRFVVA GQLLAGRYKL
ADALLLLPGN RVLVAADDMA PEIFDVARGR SSRVDYDLGE RWNFMAMVRV DSRRALLAGG
YSEKGIDPTD RSWVIHLPTG AS