Gene Sala_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0450 
Symbol 
ID4080939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp460587 
End bp462050 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content66% 
IMG OID638008808 
ProductTonB-dependent receptor, plug 
Protein accessionYP_615504 
Protein GI103485943 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.144048 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.622728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCAGG ATATTCGCAG GCGCCGGATC GTTCCGCTGA TGGGAGCGCT GTCGGTGCTG 
ATCGCACCCG AACTGGCGTG GGCGCAGGAG GGCGCGGCCT CCGCGACGGG TGGCCTCGAC
GAAATCGTCG TCACGGCGCG CAAGCGCGAG GAGAATCTCC AGTCGGCGCC GCTGTCGGTC
GCGGCTTTCA GCGGCGATAC GCTGGCCAGG GCCGGGATCG ACGAGTTCGC GGAGATAGCG
ACCCGGGTAC CGGGCTTCAC ACTCAATCCG GACAATGTCT CCGAACCCAA TATCTTCCTT
CGCGGGATCG GAACCGACAT CGAAAGCGCC GCGTCGAGCG CCGCGATCGG TTTTTTCCTG
AACGATGTCT ATTTGCCGCG CGCGTCGGGC ACCGCGATCG AACTGTTCGA TCTGGAGCGC
GTCGAGATTG TGCGCGGCCC GCAGGGCACG CTCTATGGCA AGAATGTCGT TGGCGGCGCG
ATCAACTTCA TCACCCGAAA GCCGACCGAC GCATTTCGTG CGGGGGTCGA GGCGGGGATC
GGCAACTATG GCTCGTTCGA CGTGAAAGCG ACGATCGCGG GCGGCATCGG CGAAGGGTTG
TCGGGCAGCC TGGCCGCGGC CGCGCGGCGC CGCGACGGTT TTGCCTTCAA CAGCTTCACG
GGCAATGACG TCGAGGATCT GTCGGCCTTC GGCCTGATGG GCCAGTTGCG CTATCAGCCC
GGCGACAGCC TCGACATTCT GCTTACGGGC GATCTGACGC GGCGGCGGGC GCGGGGCAAA
TGGGTCGACA TCCAGACGCC GTCGACGCAC AATATCCCCT TCGTGAACCC CGATCCGCGG
CGCGGGCCGA ATAATGTCGA CGGTCGGCAG GACGCCGATC TTGGCGGCAT CCATCTGAGC
GCGAACTGGG ACAGCGGCGC CGGCACACTG ACCCTGATCA GCGCCTACCG CGAGGGCGAT
TTCAGCGTGC TCAACAATGA TGCGGGCAGC TTCATTGATT TTACGCGCCT GGTCTATGAC
GGCAATGGCC GGATCGATTT CCTTGCGATC GACCGCAGCC GGTTCAACGA CGATTATTTC
ATCAACGACA AGGACGCGAT CCACCTGCTC GCCCAGCAAG CCCCACGAGC CCAGGTCCGG
TTCGGCCGCC ACGGCGACCA GCAGCCACTC ACGCTCGGGC GGAAGCCCCG GCTTGTGGTG
CCCGCCCAGC TTGCCCGGCG CGACGCCGCC CGTCCGGCGC TGCCGCTGGA CCGGCGCATC
CGCGCTGGCG ACGCTCCCCC CGACCTGCGG TGCCGTCCCG CGCGCCGACG CGCCCAATGC
CCGCGCGGCA ACCGCCCTTG CCGACGATCA ATCGAATTGG CTCGGCTGGT CCCTGTTGGC
CTCCCGCGCA GCCAACAGGT CGAATCGGAA AATCAACGCT GTCTGAATCC CCAATCGATT
CGCTCAAACA AGAGGGCGCG CTAG
 
Protein sequence
MRQDIRRRRI VPLMGALSVL IAPELAWAQE GAASATGGLD EIVVTARKRE ENLQSAPLSV 
AAFSGDTLAR AGIDEFAEIA TRVPGFTLNP DNVSEPNIFL RGIGTDIESA ASSAAIGFFL
NDVYLPRASG TAIELFDLER VEIVRGPQGT LYGKNVVGGA INFITRKPTD AFRAGVEAGI
GNYGSFDVKA TIAGGIGEGL SGSLAAAARR RDGFAFNSFT GNDVEDLSAF GLMGQLRYQP
GDSLDILLTG DLTRRRARGK WVDIQTPSTH NIPFVNPDPR RGPNNVDGRQ DADLGGIHLS
ANWDSGAGTL TLISAYREGD FSVLNNDAGS FIDFTRLVYD GNGRIDFLAI DRSRFNDDYF
INDKDAIHLL AQQAPRAQVR FGRHGDQQPL TLGRKPRLVV PAQLARRDAA RPALPLDRRI
RAGDAPPDLR CRPARRRAQC PRGNRPCRRS IELARLVPVG LPRSQQVESE NQRCLNPQSI
RSNKRAR