Gene Saro_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2969 
Symbol 
ID3917404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3188360 
End bp3189616 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content66% 
IMG OID640445747 
Producthypothetical protein 
Protein accessionYP_498238 
Protein GI87200981 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0451316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACCA GGATCCTCGG CGCCGCCGCG ATGGCAGCGC CGCTCGCTCT CGCCCAACCT 
GCGGAAGCTG CGCCCGCGTC TTTCGGCGAG GCTGTCGAAG TCGCCGATGG CACCGTGCTG
GACCCGATAG TCGAGGCACG CCTGCGCTAT GAAGGCGTGG ACCAGCCGAC CACCGACGCC
GACGCCCTGA CCGTGCGGCT GCGCGCGGGG TTCGAGGTGC GTCATGCGCC CTCGCACCTC
TCGTTCCTGG CCGAGGCGGA AGGGACGCTC GGCCTGTGGA ACGACTACAA CGCCTTTCCC
TTCGCGCTTG CCGGCAGTAG CCAAAGGCGA CCGCAGTTCG CCACGGTTCC CGATGCGGAA
AGCATCGATC TCAATCGCTT GCAGGTGCAG TACCGGACGA AGGGCCTCGC GGTCACGGTC
GGGCGTCAGC GCATCAATCT CGACGACCAG CGCTTTGTCG GTTCGGTCGG ATGGCGGCAG
AACGAGCAGA CATTCGATGC GGTGCGGGCC GAGGTCGCGG CAGGGCCGGT CACGTTCGAT
GGCACTTACG CGATCCGGCA GGATTCGATC TTCGGATCGG AGGCCGGGCC GCGCCGCGCG
ATGGACGGGG ACTTCGTGTT CCTCAACGCC GGGGCGAAAA CGGGGGCGGT GACCGCCAAG
GGCTTTGCCT ATCTCATTGA TTATGAAGAG GCCTTTGCCT TCGCCAATTC CTCGCAGACC
TATGGCGGGC GGATCGCGGC CGGCTTCCCC CTGTCGGCCA AGGTCAAGCT CAGCCTCGTC
GGCAGCTATG CCCGGCAGAT GGACATGGGC CGCAACCCGG TCCGCTACCG CGCCGACTAT
CTGCTTGGCG AGGCGGGGCT TTCGTCGCGC GGGTTCACCC TGACGGGCGG CTACGAACGT
CTCGGGGCAG ACGGGACGGC GGGCAAGGCC TTCCAGACGC CGCTGGCGAC GCTGCACAAG
TTCAACGGTT GGGCCGACCT GTTCCTGACG ACGCCCGCCG CCGGGCTCGA AGACCGCTAT
GTCACGCTGG CGAAGGTCTT CCCGAAAGTG AAGGCGCTGC CGGGGCTCAA TGCCATGGTG
ACCTGGCACG ACCTGCGCAG TGATATCGGA AACGCCCGCT ATGGCACCGA ATGGGACGCC
AGTGTCGGGT TCAGGTCCGG CAAGGTCGCA TGGCTGGCCA AATATGCCGA CTACGACGCG
AAGAGCTTCG GTACGGACCG CCGTATCGTG TGGCTCCAGG CTGAAGTCGC ATTCTGA
 
Protein sequence
MKTRILGAAA MAAPLALAQP AEAAPASFGE AVEVADGTVL DPIVEARLRY EGVDQPTTDA 
DALTVRLRAG FEVRHAPSHL SFLAEAEGTL GLWNDYNAFP FALAGSSQRR PQFATVPDAE
SIDLNRLQVQ YRTKGLAVTV GRQRINLDDQ RFVGSVGWRQ NEQTFDAVRA EVAAGPVTFD
GTYAIRQDSI FGSEAGPRRA MDGDFVFLNA GAKTGAVTAK GFAYLIDYEE AFAFANSSQT
YGGRIAAGFP LSAKVKLSLV GSYARQMDMG RNPVRYRADY LLGEAGLSSR GFTLTGGYER
LGADGTAGKA FQTPLATLHK FNGWADLFLT TPAAGLEDRY VTLAKVFPKV KALPGLNAMV
TWHDLRSDIG NARYGTEWDA SVGFRSGKVA WLAKYADYDA KSFGTDRRIV WLQAEVAF