Gene Saro_2962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2962 
Symbol 
ID3917397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3178547 
End bp3180175 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content66% 
IMG OID640445740 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_498231 
Protein GI87200974 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.684598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAT TGAACCTCGA CCGTCGCGGC CTTCTTGGTG CGGGCCTTGT CGGCGCGGCC 
AGCCTCGGCC TCGGCACGGG CGCCAGCGCG AAGAACCCCG CCCCGCTCGC CCCGCACATG
ACCAAGGCCG ATTTCGCGGG CGCGATGAAG GCATTTCGCG GCGTCGTCGG CGCCGAATGG
GTGTTCGGCG ACGAGGAGGC CGTTGCGCCC TACACCAAGG TCTACGTGCC CGATCCCGCC
AACCGCCATG TGCCGATCGG CGCCGTCTGC CCGGAATCGG TGGAGCAGGT GCAGGAAATC
GTCCGCATCG CCAACAAGTA CCGCCAGCCG CTGTGGCCAG TCTCCACTGG CAAGAACATG
GGCTATGGCA TGACCGCGCC GGCAACGCCG GGCCAGGTCG TGCTCGACCT CAAGCGGATG
AACCGCATTC TCGAGGTCGA CGCGGACCTC GGCACTTGCC TGCTGGAGCC GGGCGTCACC
TACCAGCAGC TCAAGGACTA CCTTGTAGAG AACAACATCC CGCTGTGGAT CGACGTGCCG
ACAGTGGGCC CGGTGGCCTC GCCGGTGGGC AACACGCTCG ACCGCGGGGT GGGCTACACG
CCTTATGGCG AACACTTCAT GTTCCAGTGC GGCATGGAAG TCGTACTCGC CGACGGTCAG
GTCATGCGCA CCGGCATGGG CTCGATCAAG GGCAGCACCG CGTGGCAGGC GTTCAAGTGG
GGCTACGGCC CTTATCTCGA CGGCCTTTTC ACCCAGTCGA ACTTCGGCGT GGTCACCAAG
ATGGGCTTGT GGCTGATGCC CAGGCCCCCG GTCTACAAGC CTTTCATGGT TCGCCATGGC
GAGATGGCCG ACGTCCCGCG CATCATCGAG GCGATGCGCC CGCTTCGTGT CTCGAACCTC
GTCGCCAATT GCAACCTGAT GATGAGCGCG TCCTACCAGC TTGCCATGTT CAAGCGCCGC
AACGAGATCG TCGCTGACGG CGTGCCGCTC GATGATGCCT CGCTCAAGAA GGTGGCCAAG
GCCAACGGCC TGGGCATGTG GAACACCTAC TTCGCGCTCT ACGGCACCGA ACAGACCGTC
GCGGCGATCG AGCCGATCAT CCGCGCGAGC CTTGTGGCAA GCGGCGGCGA AGTGCTGACC
GCCGCCGAGA TGGGCGACAA CCCCTGGTTC CACCACCACG CCACGCTGAT GGAAGGCGGG
CTCAATCTCG ACGAGGTCGG CCTGCTGCGC TGGCGCGGTG CGGGCGGTGG CCTCGCCTGG
TTCGCCCCCG TCGCCGCCGC GCGAGGGATC GAGGCCGAGC GACAGACCGC GCTCGCCAGG
GAAATCCTCG AGAAGCACGG CTTCGACTAT ACCGCCGCCT ACGCCATCGG CTGGCGCGAC
CTGCATCACA TCATCGCCCT GCTGTTCGAC AAATCCGATG CCGATCAGGA ACGCAAGGCT
GACGCCTGCT ACCGCGAACT GGTCACCCGC TTCGGCGCGC AAGGCTGGGC GAGCTACCGC
ACCGGGGTCA ATTCGATGGA CCTCGTCGCG CAGCAGTACG GGCAGGTGAA CCGCGAGTTC
AACGCGAAGA TCAAGCATGC CGTCGATCCA AACGGCATCC TTGCTCCCGG CAAATCGGGG
ATTGTGTGA
 
Protein sequence
MSELNLDRRG LLGAGLVGAA SLGLGTGASA KNPAPLAPHM TKADFAGAMK AFRGVVGAEW 
VFGDEEAVAP YTKVYVPDPA NRHVPIGAVC PESVEQVQEI VRIANKYRQP LWPVSTGKNM
GYGMTAPATP GQVVLDLKRM NRILEVDADL GTCLLEPGVT YQQLKDYLVE NNIPLWIDVP
TVGPVASPVG NTLDRGVGYT PYGEHFMFQC GMEVVLADGQ VMRTGMGSIK GSTAWQAFKW
GYGPYLDGLF TQSNFGVVTK MGLWLMPRPP VYKPFMVRHG EMADVPRIIE AMRPLRVSNL
VANCNLMMSA SYQLAMFKRR NEIVADGVPL DDASLKKVAK ANGLGMWNTY FALYGTEQTV
AAIEPIIRAS LVASGGEVLT AAEMGDNPWF HHHATLMEGG LNLDEVGLLR WRGAGGGLAW
FAPVAAARGI EAERQTALAR EILEKHGFDY TAAYAIGWRD LHHIIALLFD KSDADQERKA
DACYRELVTR FGAQGWASYR TGVNSMDLVA QQYGQVNREF NAKIKHAVDP NGILAPGKSG
IV