Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2962 |
Symbol | |
ID | 3917397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3178547 |
End bp | 3180175 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445740 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_498231 |
Protein GI | 87200974 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.684598 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAT TGAACCTCGA CCGTCGCGGC CTTCTTGGTG CGGGCCTTGT CGGCGCGGCC AGCCTCGGCC TCGGCACGGG CGCCAGCGCG AAGAACCCCG CCCCGCTCGC CCCGCACATG ACCAAGGCCG ATTTCGCGGG CGCGATGAAG GCATTTCGCG GCGTCGTCGG CGCCGAATGG GTGTTCGGCG ACGAGGAGGC CGTTGCGCCC TACACCAAGG TCTACGTGCC CGATCCCGCC AACCGCCATG TGCCGATCGG CGCCGTCTGC CCGGAATCGG TGGAGCAGGT GCAGGAAATC GTCCGCATCG CCAACAAGTA CCGCCAGCCG CTGTGGCCAG TCTCCACTGG CAAGAACATG GGCTATGGCA TGACCGCGCC GGCAACGCCG GGCCAGGTCG TGCTCGACCT CAAGCGGATG AACCGCATTC TCGAGGTCGA CGCGGACCTC GGCACTTGCC TGCTGGAGCC GGGCGTCACC TACCAGCAGC TCAAGGACTA CCTTGTAGAG AACAACATCC CGCTGTGGAT CGACGTGCCG ACAGTGGGCC CGGTGGCCTC GCCGGTGGGC AACACGCTCG ACCGCGGGGT GGGCTACACG CCTTATGGCG AACACTTCAT GTTCCAGTGC GGCATGGAAG TCGTACTCGC CGACGGTCAG GTCATGCGCA CCGGCATGGG CTCGATCAAG GGCAGCACCG CGTGGCAGGC GTTCAAGTGG GGCTACGGCC CTTATCTCGA CGGCCTTTTC ACCCAGTCGA ACTTCGGCGT GGTCACCAAG ATGGGCTTGT GGCTGATGCC CAGGCCCCCG GTCTACAAGC CTTTCATGGT TCGCCATGGC GAGATGGCCG ACGTCCCGCG CATCATCGAG GCGATGCGCC CGCTTCGTGT CTCGAACCTC GTCGCCAATT GCAACCTGAT GATGAGCGCG TCCTACCAGC TTGCCATGTT CAAGCGCCGC AACGAGATCG TCGCTGACGG CGTGCCGCTC GATGATGCCT CGCTCAAGAA GGTGGCCAAG GCCAACGGCC TGGGCATGTG GAACACCTAC TTCGCGCTCT ACGGCACCGA ACAGACCGTC GCGGCGATCG AGCCGATCAT CCGCGCGAGC CTTGTGGCAA GCGGCGGCGA AGTGCTGACC GCCGCCGAGA TGGGCGACAA CCCCTGGTTC CACCACCACG CCACGCTGAT GGAAGGCGGG CTCAATCTCG ACGAGGTCGG CCTGCTGCGC TGGCGCGGTG CGGGCGGTGG CCTCGCCTGG TTCGCCCCCG TCGCCGCCGC GCGAGGGATC GAGGCCGAGC GACAGACCGC GCTCGCCAGG GAAATCCTCG AGAAGCACGG CTTCGACTAT ACCGCCGCCT ACGCCATCGG CTGGCGCGAC CTGCATCACA TCATCGCCCT GCTGTTCGAC AAATCCGATG CCGATCAGGA ACGCAAGGCT GACGCCTGCT ACCGCGAACT GGTCACCCGC TTCGGCGCGC AAGGCTGGGC GAGCTACCGC ACCGGGGTCA ATTCGATGGA CCTCGTCGCG CAGCAGTACG GGCAGGTGAA CCGCGAGTTC AACGCGAAGA TCAAGCATGC CGTCGATCCA AACGGCATCC TTGCTCCCGG CAAATCGGGG ATTGTGTGA
|
Protein sequence | MSELNLDRRG LLGAGLVGAA SLGLGTGASA KNPAPLAPHM TKADFAGAMK AFRGVVGAEW VFGDEEAVAP YTKVYVPDPA NRHVPIGAVC PESVEQVQEI VRIANKYRQP LWPVSTGKNM GYGMTAPATP GQVVLDLKRM NRILEVDADL GTCLLEPGVT YQQLKDYLVE NNIPLWIDVP TVGPVASPVG NTLDRGVGYT PYGEHFMFQC GMEVVLADGQ VMRTGMGSIK GSTAWQAFKW GYGPYLDGLF TQSNFGVVTK MGLWLMPRPP VYKPFMVRHG EMADVPRIIE AMRPLRVSNL VANCNLMMSA SYQLAMFKRR NEIVADGVPL DDASLKKVAK ANGLGMWNTY FALYGTEQTV AAIEPIIRAS LVASGGEVLT AAEMGDNPWF HHHATLMEGG LNLDEVGLLR WRGAGGGLAW FAPVAAARGI EAERQTALAR EILEKHGFDY TAAYAIGWRD LHHIIALLFD KSDADQERKA DACYRELVTR FGAQGWASYR TGVNSMDLVA QQYGQVNREF NAKIKHAVDP NGILAPGKSG IV
|
| |