Gene Saro_3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3995 
Symbol 
ID5077525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp163015 
End bp164469 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content64% 
IMG OID640481100 
ProductTraH family protein 
Protein accessionYP_001165762 
Protein GI146275601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCCC GTCGCTTCGC CCGCCGCCTC CATGGAGCCT CGCGCCGCGC ACTTGGCACA 
GTCCTCGGTG CCGCGGTTCT GCTCGCTTCG CCCACACCCG CCTCGGCCGG GGTCGAAGGC
GAGATGCAGA GCTTCATGTC CGACATGGGC GTCCAGGCCA ATGTCACCGG TCCCAGCGCC
TACCAGGGCC AGTCGGCGGG CTATTATTCG ATGGGCTCGG TCTGGTCGCG CTTCCCGCAA
AAGAACATCC AGCCCTTCAA CCTCCAGTTG CCCCACGCGC GCGCCGGGTG CGGCGGCATC
GACCTCTTTG CCGGGTCGTT CTCGTTCATC AACACCGCCG AACTTGTCGC CATGCTGAAA
GCGACCGCGA ACAACGCGCT CGGCTTTGCC TTCAAACTCG CGATCGACAC GATCTCGCCC
GAGATCGGCA AGGTCATGGA TGAGCTGGCG CAGAAGGTTC AGCAGATGAA CCAGATGAAC
ATCTCGTCCT GCGAGACCGC GCAGGCGCTG GTCGGCGGCC TCTGGCCGAA GAGCGATACG
GCGAGTTCGG TCATCTGCGA GGCGATCGCC AACAGTCAGG GCGCGGTCTC CGACTGGGCC
CGCGCGCGCC AGCAGTGCAA CAACGGCGGC CAGCGCGAAG CCTTGAAAAG CGCCAATTCC
GACCCGGACA TGAAGGAACA GGCCGGGATG CCCAACAATT ACACCTGGGC GGCATTGGGC
AAGAAATACG GTGGGTTCGA CACCCAGTTC CGCGAGTTCC TGATGACCCT CGTCGGGACC
GTGATCTACG ATCCGGCCGG TAATGGCGGC AAGCCGAGGG TCCAGTTCAT CGGCCCGGCC
GACCCGGCGC TGATCAGCGC CATGCTCGAC GGCACCTCTT CCACCCCGCA CAAATACTGG
AGCTGCGGCG GCGATAGCGC CAAGTGCATG GCGCCGAGCG AGATCGACAT GGTGATCGGG
CCTAATGCAG CGATCAAGGC GCGGGTGCGC ACGCTGATCG AGAGCATGGC CTTGAAAGTG
CGCGATCCCG GCGCCTCGTT GACCCCGGCC GAGATCCAGC TCCTCGGCAT GGCGAGCGTG
CCGGTCTACA AGATCATCAC CGTGAGCGCA GCGGCCGAGT TCGGCATCTC CGCGCAGGAG
ATCAACGACC TTTCCGAAAT AGTCGCGGTC GATCTCGTCA CCACCATGAC CATGCGGTTC
ATCGACATGG CAGTGAACGC GCGTTCGGAC TTCAACGGGG CTGATGCGGA TAGCTTACGC
GAATGGCGCG AGGGGCTTTA CGAGACCCGG CGCAATTTCC TCGGGATCGC GGCGCGCACC
TCGCAGCGCT TCGACCAGAC CTTTGCGCTG ATCCAGCGCA CGCAGATGCT CGAAAAGACC
CTGCGTACCC AGCTCTCGCC CCAGATGTCG GCCGCGCTGC GCTTTTCGCG CACGCTCGGC
AGCCAGGTCC AGTAA
 
Protein sequence
MRARRFARRL HGASRRALGT VLGAAVLLAS PTPASAGVEG EMQSFMSDMG VQANVTGPSA 
YQGQSAGYYS MGSVWSRFPQ KNIQPFNLQL PHARAGCGGI DLFAGSFSFI NTAELVAMLK
ATANNALGFA FKLAIDTISP EIGKVMDELA QKVQQMNQMN ISSCETAQAL VGGLWPKSDT
ASSVICEAIA NSQGAVSDWA RARQQCNNGG QREALKSANS DPDMKEQAGM PNNYTWAALG
KKYGGFDTQF REFLMTLVGT VIYDPAGNGG KPRVQFIGPA DPALISAMLD GTSSTPHKYW
SCGGDSAKCM APSEIDMVIG PNAAIKARVR TLIESMALKV RDPGASLTPA EIQLLGMASV
PVYKIITVSA AAEFGISAQE INDLSEIVAV DLVTTMTMRF IDMAVNARSD FNGADADSLR
EWREGLYETR RNFLGIAART SQRFDQTFAL IQRTQMLEKT LRTQLSPQMS AALRFSRTLG
SQVQ