Gene Sala_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1853 
Symbol 
ID4082031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1947293 
End bp1948447 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content65% 
IMG OID638010228 
Producthypothetical protein 
Protein accessionYP_616898 
Protein GI103487337 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.575321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGG GAGGGGGCTG CATGACCACG CAACGCCACT ACGGCATGGA CTGGCTGCGC 
ATCGGCGCCT TCGCGCTGCT GATCCTCTAT CATATCGGCA TGTATTTCGT GCCGTGGGAT
TGGCATGTGA AGATCGATCC GACGATCGAT TGGGTTGCGT TGCCGATGTA CGCGACGAAT
GGCTGGCGGC TTCCATTGCT GTTTCTGGTG TCGGGTTATG CGAATGCTGC GTTGCTGGTG
AAGCTGGACG GCGCCGGGGC GTTCGCGCGG TCGCGCAGCG CGCGGTTGCT GATCCCGCTG
GCGTTCGGGA TCATCGTGGT CATCCCGCCG CAACCATGGG TCGAGCTGGT CGGCCAGCAT
GGCTATCCGC ACGGCTTTCT GCGCTTCTGG ACGTCGGACT ATTTCCGCTT CGGCACGCTG
GGCGGGATCG TGCTGCCGAC CTGGCAGCAT CTGTGGTTCG TTGTCTATCT GTGGACCTAT
ACGATGCTCG CCGCGCTGCT GCTCGCGGCG GTGCCCGCGG CGATGCGGGG GTGGATCGCC
GATGGCGCGG CGCGGCTGCT GTCGGGCTGG CGGCTGCTGA TCGTGCCGAT GCTGTGGTGG
CTCGCGGTTT ACGGCGCCTT TCCTGAGCAT GACGAGACGC ATGCGCTGTT CGACGACGGG
CCGGCGCATC TCCGCTATCT TATGGCTTTC GGCGCGGGCT GGCTGCTGCG CGTGCGGCCC
GCCCTGTTCG CGGCCGTCGC CCGCTGCTGG AAGGTCGCCG CGCTGCTTGG CCTGCTGGCG
TTCCTGCCGA TCATGTGGGT CGAATCGACT TGGTCCGGCG ACATGCGCGC GCCCGACTGG
GCGATCGCGC TATTCCATGT CGCGCGGCGG GTGCAGGGCT GGGTAGCGAT CGTCGCGCTC
ATCGGCGTCG CCGACCGGTA CTGGAACCGC GACCATCCAA AACGCGCGAT GTTCGCCGAG
GCGGTCTTCC CCTTTTACAT CATTCACCAG ACGATCATCG TCGTCGCCGG ATGGTATCTG
CTGCGAGCGG GCGTGGCGGC GTTGCCATCC TTCCTGATCC TGCTCGCGGC GACGATGCTG
GGATGCTGGC TTTTCTATGC GATTGGGCGC AGCATCGGCT GGCTGCGGCC GCTGATCGGG
CTCCAACGGC GATAG
 
Protein sequence
MMKGGGCMTT QRHYGMDWLR IGAFALLILY HIGMYFVPWD WHVKIDPTID WVALPMYATN 
GWRLPLLFLV SGYANAALLV KLDGAGAFAR SRSARLLIPL AFGIIVVIPP QPWVELVGQH
GYPHGFLRFW TSDYFRFGTL GGIVLPTWQH LWFVVYLWTY TMLAALLLAA VPAAMRGWIA
DGAARLLSGW RLLIVPMLWW LAVYGAFPEH DETHALFDDG PAHLRYLMAF GAGWLLRVRP
ALFAAVARCW KVAALLGLLA FLPIMWVEST WSGDMRAPDW AIALFHVARR VQGWVAIVAL
IGVADRYWNR DHPKRAMFAE AVFPFYIIHQ TIIVVAGWYL LRAGVAALPS FLILLAATML
GCWLFYAIGR SIGWLRPLIG LQRR