Gene Sala_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2004 
Symbol 
ID4082169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2113125 
End bp2114435 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID638010380 
Producthypothetical protein 
Protein accessionYP_617048 
Protein GI103487487 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0120622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACG CAACCGACTG CGTCGGCGAA ACGCGGACGA AACTCGATCG CTGGTTGAAG 
GCGCTGTCGG ACAGGAAATG CGCGCAGCTG CTGACCGACT GGTCGTGGTG GCGGCGCGCG
GACCAGAATC CGCCGACGGG CGACTGGCAT GTGTGGCTGC TGCTCGCGGG GCGCGGGTTC
GGCAAAACGC GCACCGGCGC CGAGTGGGTG CGCGCCTTTG CGGAGACGAC GCCGGGTGCG
CGGATCGCGC TGGTCGCGGC GTCGTTGCTG GAGGCGCGGC AGGTGATGGT CGAGGGCGAA
AGCGGGTTAT TGGCGATTGC ACCCGACCAT CTGCGCCCCG AATATGAAAG CAGCCTGCGG
CGGCTGACGT GGCCGAACGG CGCGGTGGCA ACGCTCTATT CGGCTGTCGA GCCCGACAGT
CTGCGCGGTC CTGAGCATGA TGCGGCGTGG TGCGACGAGA TCGCAAAATG GCCGAAGGGC
GAGGCCGCAT GGGATAATCT GATGCTGACA ATGCGGATTG GCGCGCGTCC ACAGGTCGTC
GCGACGACGA CGCCGCGCTG CGTGCCGCTG GTACGGCGAC TGATACAGGA AAGGGGGGTT
GCGACGACGC GCGGGCGCAC GGCGAGCAAC CGGCGCAATT TGTCGGTTCA ATGGCTGGCG
ACGATGGATG CCATCTATGG CGGGACGCGG CTGGGGCGGC AGGAGCTGGA CGGCGAATTG
CTGGAGGATG TCGAGGACGC GCTGTGGACG CGCGCGCTGA TCGAGCGGTG CCGCGTCGAT
GCGGGGAGCA TCGGCAAATT CGCGCGCGTC GTGATCGGCG TCGATCCGCC GGCGAGCGCG
GGGGGCGATG CGTGCGGGAT CGTGGTGGCG GCGCTGCTGC GCGACGGGCG GCTGGCGGTG
GTCGAGGATG CGAGCGCCCT ACGCCCGCTG CCGGGCGTGT GGGCGCAGGC GGTGGCCGCC
GCGGCGGCGC GCTGGGGCGC CGAGCGCGTG GTGGCCGAGA GCAATATGGG CGGCGACATG
GTCGCGGCGG TGCTGCGCCA GGCCGACATG ACGCTGCCCG TCGTGGCGAT TCATGCGAGC
GTCGGCAAGG CGCGGCGCGC GGAGCCGGTG GCGCTGGCCT ATGAGCGCGG GCAGGTGGTC
CATGCGGGGG CGTTTGCCGA CCTGGAGGAC CAGCTTTGCG GATTGCAGAT GGGCGGCGGC
TATGCGGGGC CGGGGCGCTC GCCCGACCGG GCGGATGCGT GCGTGTGGGC GCTGGCGGCG
TTGCTGGACG GGATGCGCAA GGGGCGCGGG CCGGGGGTGC GGGTGGTTTA G
 
Protein sequence
MKHATDCVGE TRTKLDRWLK ALSDRKCAQL LTDWSWWRRA DQNPPTGDWH VWLLLAGRGF 
GKTRTGAEWV RAFAETTPGA RIALVAASLL EARQVMVEGE SGLLAIAPDH LRPEYESSLR
RLTWPNGAVA TLYSAVEPDS LRGPEHDAAW CDEIAKWPKG EAAWDNLMLT MRIGARPQVV
ATTTPRCVPL VRRLIQERGV ATTRGRTASN RRNLSVQWLA TMDAIYGGTR LGRQELDGEL
LEDVEDALWT RALIERCRVD AGSIGKFARV VIGVDPPASA GGDACGIVVA ALLRDGRLAV
VEDASALRPL PGVWAQAVAA AAARWGAERV VAESNMGGDM VAAVLRQADM TLPVVAIHAS
VGKARRAEPV ALAYERGQVV HAGAFADLED QLCGLQMGGG YAGPGRSPDR ADACVWALAA
LLDGMRKGRG PGVRVV