Gene Sala_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2055 
Symbol 
ID4080122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2164284 
End bp2165639 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content67% 
IMG OID638010429 
Producthypothetical protein 
Protein accessionYP_617097 
Protein GI103487536 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.655381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.223767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAA TGTCGCTCAT CATCGCCCTT GCGGCCCTTG CCATCGGCGC GTTGATCGGC 
TGGTTGTTCG CCGCTCGCCA GTCGGGGGCG CTGATGGCTG AACGCGACGG GCTGGCCGAG
CGGTTCAGGA GCGCCGTCAC CGACCTCGCC GCCGAGACCG AGGCGCGCCA GGCGGCGGAC
ATCCGGCTCG CGGCGCTGCG CGCCGAGCAA GAGGCGCGCG AGGCCGCGCA CGCGGCGCAG
GTGAGGCAGC TTCAGGACGC GCAGGCGGCG CTGACTGCGC AGTTTCGCGA GGTCGGGCAG
GCGATGCTCG GCGAGGCGCA GAAGGAGTTT CTGGAACGCG CCGAGGCCCG GTTCCGACAG
AGCGAGGAAA GCGCGGGCCA GCATCTGAAG GCGCTGCTTC AGCCCGTCCA CGAACGGCTG
GAGAAATATG AAACCGCTGT GAAGAAGGTC GAAACCGAAC GGCAAAGCGC GTTCGGCATG
TTGCAGGGGC AGATCGAATC GATGCGCGCG CAGAGCGAGC GCGTGTCGAG CGAGGCGGCC
AAGCTCGTCA ACGCGCTCCG CAATGCGCCG AAGGCGCGCG GGCGCTGGGG CGAACAGCAA
CTGCGCAACG TGCTCGAAAG CTGCGGGCTC AGCGAACATG CCGATTTCCA GACCGAGGTC
AGCGTTGCCG ATGGCGACGG CGGGCGGCTG CGTCCCGACG TTGTGGTGAA GGTTCCCGGC
GGACAGAGCC TCGTCATCGA CGCCAAGGTT TCGCTCAACG CCTATCAGGA CGCCTTCGGC
GCGGTCGACG AGGGCGAAAA GGCGGCGCAC CTTGCCGCGC ATGCCGCGGC GATGAAGGCG
CATGTCAACG CGCTGGGCGC CAAGGCCTAT TGGAACCAGT TCGACGACAC CCCCGATTTC
GTCGTGATGT TCGTCCCCGG CGAACATTTC CTCGCCGCCG CGCTCGACCA TGACCACGAG
CTTTGGGACT ATGCGTTCGA GCGCAAGGTG CTGCTCGCGA CGCCGACCAA CCTCATCGCG
ATCGCGCGCA CCGTCGCGGC GGTATGGCGG CAGGAAAAGC TCGCCAACCA GGCGCGCGAA
ATCGCGATGC TCGGCAAGGA ACTTTATGCG CGCATGTCGG TGATGGGCTC GCACATCGCG
CGCGTCGGCA AAAATCTCGA TCAGGCGACG GGCGCTTACA ATGCCTTTGT CGGCAGTTTC
GAATCGCAGG TTTTGACGCA GGCCAAGCGT TTCGAGGCGC TCGACATCGA AACCGGCGGG
CGGGAGATTC CGACGCTGCC GGTTGCCGAA CAGGCGGCGC GCCCGCTGGC GAAGCTCGCC
GCGGCGCCGA GCGCGGTGAA CGACGCGGGC GAATGA
 
Protein sequence
MDGMSLIIAL AALAIGALIG WLFAARQSGA LMAERDGLAE RFRSAVTDLA AETEARQAAD 
IRLAALRAEQ EAREAAHAAQ VRQLQDAQAA LTAQFREVGQ AMLGEAQKEF LERAEARFRQ
SEESAGQHLK ALLQPVHERL EKYETAVKKV ETERQSAFGM LQGQIESMRA QSERVSSEAA
KLVNALRNAP KARGRWGEQQ LRNVLESCGL SEHADFQTEV SVADGDGGRL RPDVVVKVPG
GQSLVIDAKV SLNAYQDAFG AVDEGEKAAH LAAHAAAMKA HVNALGAKAY WNQFDDTPDF
VVMFVPGEHF LAAALDHDHE LWDYAFERKV LLATPTNLIA IARTVAAVWR QEKLANQARE
IAMLGKELYA RMSVMGSHIA RVGKNLDQAT GAYNAFVGSF ESQVLTQAKR FEALDIETGG
REIPTLPVAE QAARPLAKLA AAPSAVNDAG E