Gene Saro_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3031 
Symbol 
ID3916643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3242442 
End bp3243542 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content62% 
IMG OID640445811 
Producthypothetical protein 
Protein accessionYP_498300 
Protein GI87201043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00944915 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC TCGGTCTCTC CATTGCCGCC CTGCTCGCCG GAACGACCGT CGCGAGCGCT 
TCCGCCCAGG CCTCCACGCT GTTCATGGGC TCCTATCCCG ACCGCATGCT GATCGTCGAC
GAGGCATCGG GCAAGGTCAC CGACACGCTG ACGCTCGCCT CCGGCCTGCC GACCTCGATG
CGGATCTCGA ACGACCGGAA GAAGATCTAC GTCACCACGA TCACGACCAG CGGGATTGAG
GTGATCGACA CCGCCACGAA GAAGGTGGTC AACTCCTTCA GCCTGAACAC CCCGACCACG
CGCTATCGCT TCAATGGCGG GGTGCCTGAT CCTTCGGGGC GCTATTTCTA CACGATGCTG
ACGAAGTTCG AGAAGCTGAA CGACCGCTAC CTCGTCAGCC CTCAGCAGTT CGCAGTGATC
GACCTTCAAA AGAAAGCCGT GGTGCGCACG TCCGAAGTGC CCAAGGAAGA TGACAGCAAC
CCCAACGCCG GCTGGCGCAC CAACTACATG ATGTCCGAGG ACGGCAAGAC CTTGTTCGTG
ATCCGCGACA AGGTGCTCGT GCTCGACACC GCCGACCTCA AGGTCAAGGA GCGGATCGAG
GTTTCGCGCC CCGAGGCCAC CGGTATCGAG GGCGTGACCT TCGGCGGCGG GGTCGAAGCG
CTGCGAAACC CGCACGAATA CGTCTCGCTG TTCAACGCGA CCGATCCCTA CATTCACAAC
AAGATCTTCG GCGTCGGGCG CTTCAACCTG GCGACCAAGG CCTTCGACTT CCGCCCGATC
GGCCCCGCGC CCTATGGCAT GGCCGGCCTG CAGGTCTCTC CCGACCTCAA GCAGGGCTGG
ACGGTCGTCA CCAACGGCAG CGTGGGCAAC AAGCGGTGCG AATTCTGGCA TCTCGACCTC
ACCACCAACC AGGTGAAGAA CAAGGCCGAA TTCCCCTGCC GTTCGCGCTT CCAGTTCGGC
ATGTCGGGCG ACGGCACGAA GCTCTACATC TACGGCGCCA GCTACGACAT CGAGATCTAC
GACGCACAGA CTCTGGCGCA CGAAAAGACG GTCGATCTCG GCGCGGACTC GACCGGCGCC
GGGATGATAA TCACCCAGTG A
 
Protein sequence
MKKLGLSIAA LLAGTTVASA SAQASTLFMG SYPDRMLIVD EASGKVTDTL TLASGLPTSM 
RISNDRKKIY VTTITTSGIE VIDTATKKVV NSFSLNTPTT RYRFNGGVPD PSGRYFYTML
TKFEKLNDRY LVSPQQFAVI DLQKKAVVRT SEVPKEDDSN PNAGWRTNYM MSEDGKTLFV
IRDKVLVLDT ADLKVKERIE VSRPEATGIE GVTFGGGVEA LRNPHEYVSL FNATDPYIHN
KIFGVGRFNL ATKAFDFRPI GPAPYGMAGL QVSPDLKQGW TVVTNGSVGN KRCEFWHLDL
TTNQVKNKAE FPCRSRFQFG MSGDGTKLYI YGASYDIEIY DAQTLAHEKT VDLGADSTGA
GMIITQ