Gene Saro_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2601 
Symbol 
ID3917016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2809891 
End bp2812260 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content64% 
IMG OID640445360 
ProductTonB-dependent receptor 
Protein accessionYP_497871 
Protein GI87200614 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.889408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT GCGTGAAGTT CGCGTGCTCC GCTTCGTTGC TGGTCCTGGC GATGCAGTCT 
TCCGCCGTCG CGGCGCAGGA CGTGCAAGGC AATGCCGGCG TTGCTGAAGA GACGACCCCG
GTCTTCGGCG ACATCGTGGT GACGGCCAAC AAGCGCCAGG AAAACGCCCA GAAGGTTCCG
ATCGCGATCA CCGCCTATTC GGGCGATCAG CTCAAGGCGC TGGGCGTTAC CGATGCGACG
CAGATCACCC AGCAGGTGCC AGGGCTCCAG CTCAACGCCT GGTCGCCGAA CGTCACGATC
TTCAACTTGC GCGGCATTTC GCAGAACAAC TTCACCGACT ACCTCGAGGC GCCGATCGCG
GTCTATGTCG ACGATGCCTA TATGGGGTCG ATCAACGGGC TTTCGGGGCA ATTGTTCGAC
GTGCAGCGCG TCGAGGTGCT GCGCGGGCCG CAGGGGACGC TTTTCGGTCG CAATGCAACC
GGCGGCCTGA TCCATTACCT GTCGACCGAC GCCAGCAAGG CGGAGTTCAA CGGCTACCTC
ACGGCGAGCT ACGAGCGCTT CGACCGGCGC GCGCTCGAAG GCGCGGTCGG TGGGGCGCTG
GCGGACGGCA TTCGCGCGCG CGTTGCCGGG CGCGTGGTCA AGGCCGACGG CTACATCAAG
TCGGCCGCGG CATTGCCGGG CGTGTTCGAG GCCAACGGGC AGGATCTGGG CAGCGAGAAC
GGCTGGGCCC TGCGCGGCAC GATCCAGGCC GATCTCGGCC CCGACGGCAA GCTTGACCTG
TGGGTCAAGC ACAGCGAGGA CAACGACGTC GCGACCGGCG GTTATGTCTT CGACAACTGC
AACCTGCAGG ACAACGGTTA CTGCGGCACC GACGCGGCGG GGCTGGGCAA TGGCAGCGGC
GGGGTCATCA ACGGCATCAC CGGCGAACCC GCCAGCCCCT TCCAGAACTT CAGCGACACG
CCCGGTGTGT TCAACCGCAA CACCAACATC TACCAGGGCA AGCTGACCTA CGACCTGGGC
GGGGTGAACC TGACCGCGAT CACCAACTAC ACCGACCTCA GGAAGGATTA CCAGGAAGAC
GGTGACGCCC TGCCGGTGGA AGTCATCGTG TTCCGCACGA ATGCCCGCTA CCGGCAGTTC
AGCCAGGAAC TGCGCCTGGC GGGCGAAAGC GAGCGCTTCC GTTGGCAGGC GGGCGCCTAC
TACCTCGACA TGAAGATCAA GGGCGGCATG GACACGGTGG GCGCGCCGGC CATCGGCGCC
GCGCTTGCGG CGGGCCTGCC CGGCGTCGCG CCGACGATTG CCGAGACCTA CAATCTGCAT
TCGAAGAACT GGTCGGTGTT CGGCCAGGCG GAATACGACC TGTCCGACAA GCTCACCGTG
ATCGGCGGCT TGCGCTATTC CAAGGATACC AAGACGGTCG ACTACCGGTC GGCAGTGGTG
GAGGGCGCCG CCTCGTCGCT GATCGCCACG GACGAGACGT TTTCGGCCAC GCTGCCCGGC
GCGGACCGCA TTTCGGACGG CGACATCGCC GCGCGCGTCA CGCTGAACTA CAAGCCCGCG
GACGATACGC TGGTGTTTGC TTCGTGGAAC CGCGGCATCA AGGGCGGCAA CTTCACGCTG
AACGGTTATG TCACCGCGCA AACCTTCCAG CATCGTCCGG AAACGCTCAA TTCGTTCGAG
GCGGGCGTGA AGTGGTCGAA CCCCTCGCGT ACGCTGCGCG TCAATGCCAC GGCCTATCAC
TACATCTACA ACGACTATCA GGCTTTTGCG CTGATCGGCG GCGTGCCGCA GGTCGGCAAT
AGCGACGCCA ACGCGACGGG CTTCGAGCTG GAAACCTTCT TCCAGCCGAC CGACCACCTG
AATATCAACC TGGGCGCGAC GTGGGAGCGT ACCCATGTCG ATACCGTCCA GACCGCCGGA
TCGCAGTTCC TGTCGGTTCT GGTGCCGGGA GCGTCGGTGC CCCAGTATTG CACCGACCAG
AACGACGGCA CCTATTTCTG CGACTACCCG ACCAAGTCGG TCAGCGGCGC GCAGTTCCCC
AACGCGCCGA AGTTGAGCCT GAACTATGTC CTGCGCTACA ACGTCGATGC CTTTGGCGGC
AATGTCGTCG CGCAGGTCGA CGGCGTCTGG TACGACAAGC AGTTCCTCGA AGTCACCAAT
GGCCGGTCCT CGATCCAGCC GGCCTACAAC GTGACCAACG CCTCGCTGAG CTGGACGTCC
GACGACGATC GCCTTTCGGT GCAGGTGTTC GGCCGGAACG TCTTCGACAA GGCCTATCGC
GCCTATGCGC TCAACCTCGG ACCGCTCGGC ACGACCTCGG TCTACGCCAA GCCCGCCACC
TATGGCGTCA GCGCCACGGT CAAGTGGTAG
 
Protein sequence
MKNCVKFACS ASLLVLAMQS SAVAAQDVQG NAGVAEETTP VFGDIVVTAN KRQENAQKVP 
IAITAYSGDQ LKALGVTDAT QITQQVPGLQ LNAWSPNVTI FNLRGISQNN FTDYLEAPIA
VYVDDAYMGS INGLSGQLFD VQRVEVLRGP QGTLFGRNAT GGLIHYLSTD ASKAEFNGYL
TASYERFDRR ALEGAVGGAL ADGIRARVAG RVVKADGYIK SAAALPGVFE ANGQDLGSEN
GWALRGTIQA DLGPDGKLDL WVKHSEDNDV ATGGYVFDNC NLQDNGYCGT DAAGLGNGSG
GVINGITGEP ASPFQNFSDT PGVFNRNTNI YQGKLTYDLG GVNLTAITNY TDLRKDYQED
GDALPVEVIV FRTNARYRQF SQELRLAGES ERFRWQAGAY YLDMKIKGGM DTVGAPAIGA
ALAAGLPGVA PTIAETYNLH SKNWSVFGQA EYDLSDKLTV IGGLRYSKDT KTVDYRSAVV
EGAASSLIAT DETFSATLPG ADRISDGDIA ARVTLNYKPA DDTLVFASWN RGIKGGNFTL
NGYVTAQTFQ HRPETLNSFE AGVKWSNPSR TLRVNATAYH YIYNDYQAFA LIGGVPQVGN
SDANATGFEL ETFFQPTDHL NINLGATWER THVDTVQTAG SQFLSVLVPG ASVPQYCTDQ
NDGTYFCDYP TKSVSGAQFP NAPKLSLNYV LRYNVDAFGG NVVAQVDGVW YDKQFLEVTN
GRSSIQPAYN VTNASLSWTS DDDRLSVQVF GRNVFDKAYR AYALNLGPLG TTSVYAKPAT
YGVSATVKW