Gene Saro_0523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0523 
Symbol 
ID3918653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp564837 
End bp567248 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content61% 
IMG OID640443253 
ProductTonB-dependent receptor 
Protein accessionYP_495804 
Protein GI87198547 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAAT CGGATCATCG GTGTGCCGTG CGCAAGAGCG GAGTGGGCTA TTTCGGCGGC 
GCAAGCCTTG TGGCACTCGC TTCGTGCTTC CTGCTGCCGT CGGTTGCTGC CGCACAGGCA
GAGTCGGCTG AGCCGCAGGA GGAGATCGCC ACCCAGGATA TCGTCGTGAC GGCCCAACGC
CGCGAAGAGC GGCTGTCGAA GGTTCCGGTC TCGGTCGTCG CGTTCGGGGC CGAAGCGCTG
CAGAACCGCA ACATCACTAG CGAACAGGAC ATCGGAACGC TCGTTCCAGG CCTTCAGGTG
AAGAACGGCC AGAACTCCAA CCAGCTTAGC TACAGCATGC GAGGCCAGTC GCTGGACCCG
TTCTCGGGCA CCAGCCCTGC CGTGCTGCCC TATCTGAATG AAGCGCCGTA CAACCCCGGC
AACACGGCCA CGGCCTTCTT CGATCTTGGT TCGATCCAGG TACTCAAGGG GCCTCAGGGC
ACGCTGTTCG GTCGCAATGC GACCGGCGGC GCCGTGCTTT ACACCACGCC GATGCCTGGA
GACACTTTCG GTGGATATGT GACCGTGCGC GGTGCATCGC GCGATTACGG GCAGATGCAG
GCGGCGGTCG ACCTGCCCAT CGCCGAAGGC AAGGCAGCCC TTCGACTGGC CTTCGACGCG
ACGCGCGGCA ATGGCTATGT CACCAATATC AACACCGGCA ATACGTTGGG CGACAAGAAC
AGCCGTTCGG GGCGCGTCAC CCTTTTGCTT ACCCCGACCG ACACGCTCAG GAACGTCACC
ATCGTCCAGT ACGATCGCGT GAAGGGCACC GAAGGCGTCG GCGGCATCTA CACCTACTAT
TCCGCGACCG ACCCGCAGTT TGTCAGCGAC GGCACTAACC ATGTCCCGGT CAGCGGTCTG
ACCAACACTC TCGCCGCGAT ATACGGCACC AATGATGGGC CGGCCGCGCC TGGCTTCTGG
CCGGGCGCCG TCGAGGGCTA TACCAAATTC ACCCGGGCAA ATCCCTACAA GGTCTGGTTG
CAGTACGATC TGCCGCACAG CGCGGAAAAC GTGTTCGTCT CGAACACCAC GGAACTCGAG
ATTGGCGCGG ACACCAAGCT GAAGAACATC TTCAGCTACA TGAACGGAAA GTCCAACACT
CCGGGTAATC TTGGTGGCGG ACCGTTCGGC TCATTGTGGC TGTTCAAGCT TGCGGGCATC
AACGCCACTG GTGCTCCCGG CGGCCAGACG TTCGAGTCCA ATACCTTCAG CGAGGAGCTT
CAGATTCAGG GAAGCTTGAT GGAGGATCGC CTCAACTATA CCGCGGGCGT GTTCTACTCG
AACCAGAAGC GCTTCGAGAT CGTCCCGATC AACATCGGCG CGGACGTGGT CCCCGGCGGC
ATCGCAGACA TCTCGTATGC CTATCGGAAC CGGCAGGAAT CGAAGGCGAT CTTCGCGCAG
GTCAGCTACA AGGTCACCGA GCAGCTCACG GCGACGCTTG GCGGTCGCTA CACCTGGGAG
AACGTCGGCA TCCGGCAGGC CCCCGGCAAC GTGTTTGGCG TCGATCCCGA CTCGCCTGCC
GCGGACCAGA GCAGGAAGCT TTCCGCTCCG GCCTGGACTG CCAGCCTGCA ATACCAGATC
GATCCCAACA ACATGGTGTA CTTCAACCAG CGCGGCAGCT TCCGTTCGGG CAACCTGAAC
GGCACGGTTG CGCCGTTCAC CGACCCATTG ACCGGCCAGC CGGCGAACTT CTTCAAGAAC
GAGAAGGTCC ACGACTTCGA ACTGGGCTAC AAGTTCAACG GCCGCATTGG CGGCGCTCCG
GTCCAGTTCA ATGTCGCCGC GTACAAGGTG ATCGTGAAGG ACGCCCAGCG TGCCCTTTAC
GCCCTCGTGG GTGGTGCCCC GGCAGGTTTC ACGGTCAACG TTCCTCAGGC CGAGACCCAG
GGTTTCGAAG TCGACGCCTT CGCGGGTCTG ACGTCGTGGC TCGATGTCGG GTTCAACCTT
GCCTACACCG ACGCGAAGTA CACCAAGCGC AGCGTGCCGA TCCCGTTCGT AGGCAATCTG
CTGGTCGACT CCTATCCTGA TGCTCCGAAG TGGGCAGGGT CGGCCAACAT CGAGATCAAA
TTCCCCTTGC CCGAGGAAAT CGGCAAGATC AGCCTTCGCG GCGATTACTA TGGGCAGACG
GGCTTCTTCT TCTCGAACAC CAATGGGACG TCGACGCCCG GAACACATCT CGATGGGTAC
TGGAACGTCG GTGCAAGGCT GAACTGGAAG GAAATCATGG GAAGCCAGGT TTCTGCCGCT
GTCTTCGTGA GGAACCTTAC CAACGAATCT TACTACATTT CCGGCTATGC TCTGGGCGCG
TCCAATGGCG TGAACACCGC ATATCCGGGC GAGCCTCGCA CCATTGGGGC AGAAATCTCC
GTCAAGTTCT AA
 
Protein sequence
MTKSDHRCAV RKSGVGYFGG ASLVALASCF LLPSVAAAQA ESAEPQEEIA TQDIVVTAQR 
REERLSKVPV SVVAFGAEAL QNRNITSEQD IGTLVPGLQV KNGQNSNQLS YSMRGQSLDP
FSGTSPAVLP YLNEAPYNPG NTATAFFDLG SIQVLKGPQG TLFGRNATGG AVLYTTPMPG
DTFGGYVTVR GASRDYGQMQ AAVDLPIAEG KAALRLAFDA TRGNGYVTNI NTGNTLGDKN
SRSGRVTLLL TPTDTLRNVT IVQYDRVKGT EGVGGIYTYY SATDPQFVSD GTNHVPVSGL
TNTLAAIYGT NDGPAAPGFW PGAVEGYTKF TRANPYKVWL QYDLPHSAEN VFVSNTTELE
IGADTKLKNI FSYMNGKSNT PGNLGGGPFG SLWLFKLAGI NATGAPGGQT FESNTFSEEL
QIQGSLMEDR LNYTAGVFYS NQKRFEIVPI NIGADVVPGG IADISYAYRN RQESKAIFAQ
VSYKVTEQLT ATLGGRYTWE NVGIRQAPGN VFGVDPDSPA ADQSRKLSAP AWTASLQYQI
DPNNMVYFNQ RGSFRSGNLN GTVAPFTDPL TGQPANFFKN EKVHDFELGY KFNGRIGGAP
VQFNVAAYKV IVKDAQRALY ALVGGAPAGF TVNVPQAETQ GFEVDAFAGL TSWLDVGFNL
AYTDAKYTKR SVPIPFVGNL LVDSYPDAPK WAGSANIEIK FPLPEEIGKI SLRGDYYGQT
GFFFSNTNGT STPGTHLDGY WNVGARLNWK EIMGSQVSAA VFVRNLTNES YYISGYALGA
SNGVNTAYPG EPRTIGAEIS VKF