Gene Saro_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1186 
Symbol 
ID3916483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1228490 
End bp1230394 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content69% 
IMG OID640443922 
ProductTonB-dependent receptor 
Protein accessionYP_496465 
Protein GI87199208 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.874956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGT ACCTGCTTTC CGTTTCCGCC GTCGCGTTCG CGTGCCCGGC GCTTGCGCAG 
GCCGCCAACG ATGTCGTCGT GGCCGACCGC CGTGCCGATC CCGCCGCAAT TACCGTTATC
GCTACCGGTA GCGAGACCCT GGTCAGCCGA GTCGGCCAGC CGGTCACCGT CATCGCCGCC
GACGAGATCC GGTCGATTCA GGGCCCTGAC ATCGCCCGCG TCCTCGAGCG CGTCCCCGGC
CTCGCGCTCA CCCGCAATGG CGGTCCCGGA AGCTTTACCG GCGTGCGCCT GCGCGGCTCT
GATGCCGAAC AGGTACTTGT CCTCGTCGAC GGCGTCCGCG TCGAGGATGT CTCGGCCCCC
TCCGGCGGCT TCGATTTTGG CACGCTCACC CCCGGCGGGA TCGAGCGCAT CGACGTGCTG
CGCGGCTCCA ACTCGATCGT CTGGGGCAGC GCCGCGATCG GCGGCGTGAT CGCCGTCCAG
TCGCGCGAAC TCGATGGCAT CGAGGCCAGC GCCGAACTCG GCAATCACGA TACCTACACC
GCTGACGCCG CCGCCGGCCT TTCTTCCGAC CGCGCCGCGC TTACCCTGAA CGGCGGCTAC
AGCAGCAGCG ACGGCGTTTC GGCCGCTGCT TCGGGTACCG AACCCGACGG CTTCCGCCAG
TGGCGCGTCG GCGGCCGGGG CCGCGTGAAC CTCACCGACG AGATCGCCGT TCTCGCCTCG
GCCCGCTATG CCGACACGCG CACCGACATC GACGGCTTCG GTCCGCCGAC CTACGTGGAA
TTCGGCGATA CGCCTGAATA CCAGACGACC CGCCAGGCCT CGGGCCGCGT CGGTCTGCGC
TACACGGGCG CCGTGCTTAC CCTCAACACT GGCTTCGCCC TGTCTGACAC CAAGCGCGAC
TACTACGACC CGACCTACTC CGCCGACCCC TCCTACGGCT ACAAGGGCCG CTCCGAGCGC
GTCGACCTGA CCGGCCGCCT CAACCTCCCC GCCGATTTCA CGCTCGATTT CGGCGGCGAC
AGCGAATGGA CGCGCTATTC CGGCACCTAC GACGCGCAGC AGAAGGCCCG GCTCACCAGC
GGCCACGCCC TGCTCGGCTG GAGCAGCGCG CGCGCCAGCC TCGCTGCGGG CGTGCGCGTC
GACGATCACA GCCGCTTCGG CACCGCGTGG ACCTTCGGCG CCAACGGTAC GCTTGAATTG
GTGGACAGCC TGCGCCTGCG CGCCTCCTAC GGCGAAGGCT TCAAGGCCCC CACGCTCTTC
CAGCTCCTGT CCGACTACGG CAACGCGGAT CTCAGGGCGG AACGCAGCAA GTCCTACGAC
GCGGGTCTCG AATGGGGCAG CCGCACCGGC AAGCTCCACG CCGCCGTCAC GGTCTTCCGC
CGCGACAGCC GCAACCTGAT CACCTTCGTC TCCTGCGCCA GCCTCGATGC CTGCGCCACG
CGTCCCTATG GTCTCTACGA CAACGTCGGC CTGGCCCGCG CGCAGGGCGT CGAAGCCGAA
CTCGGCGCCC GCCCGGTCGA TACGCTCCGC CTCCAGGCCG CCTACACCTA CCTCGAAACC
GAGAACCGCA CCGCCGGCAC CGCCAACTTC GGCAATGACC TGGCGCGCCG TCCCGCCCAT
GCGCTGACCC TGTCGAGCGA CTGGGCCCCT GTTGACTCCG GCCCGCTCGC CGGCTTCACC
CTCGGCGCTG ACCTCCGCCT CGTGGGAGAC AGCTATGACA ACGCGTCGAA CACCCGCCGC
CTCGATGGCT ATGCGCTGAC CACCGTGCGC GCCAGCTTCC CCCTTACCGA GAACGTCGAA
CTCTACGGCC GCGTCGAGAA CCTTTTCGAC GTCGCCTACC AGACCGTGGC CGACTACGGC
ACCTGGGGCC GCTCTGCCTT CCTCGGCATT CGCGCCCGCT ACTGA
 
Protein sequence
MKKYLLSVSA VAFACPALAQ AANDVVVADR RADPAAITVI ATGSETLVSR VGQPVTVIAA 
DEIRSIQGPD IARVLERVPG LALTRNGGPG SFTGVRLRGS DAEQVLVLVD GVRVEDVSAP
SGGFDFGTLT PGGIERIDVL RGSNSIVWGS AAIGGVIAVQ SRELDGIEAS AELGNHDTYT
ADAAAGLSSD RAALTLNGGY SSSDGVSAAA SGTEPDGFRQ WRVGGRGRVN LTDEIAVLAS
ARYADTRTDI DGFGPPTYVE FGDTPEYQTT RQASGRVGLR YTGAVLTLNT GFALSDTKRD
YYDPTYSADP SYGYKGRSER VDLTGRLNLP ADFTLDFGGD SEWTRYSGTY DAQQKARLTS
GHALLGWSSA RASLAAGVRV DDHSRFGTAW TFGANGTLEL VDSLRLRASY GEGFKAPTLF
QLLSDYGNAD LRAERSKSYD AGLEWGSRTG KLHAAVTVFR RDSRNLITFV SCASLDACAT
RPYGLYDNVG LARAQGVEAE LGARPVDTLR LQAAYTYLET ENRTAGTANF GNDLARRPAH
ALTLSSDWAP VDSGPLAGFT LGADLRLVGD SYDNASNTRR LDGYALTTVR ASFPLTENVE
LYGRVENLFD VAYQTVADYG TWGRSAFLGI RARY