Gene Saro_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1028 
Symbol 
ID3915810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1065625 
End bp1067019 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content68% 
IMG OID640443762 
ProductXRE family transcriptional regulator 
Protein accessionYP_496307 
Protein GI87199050 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCCC GACCGCTCTA TCTCGGCCCC CGGCTGAAGC GCATCCGGCG CGAACTGGGA 
CTTACCCAGC AGGCCATGGC GGAGGAGCTG GGGATTTCGC CCAGTTACAT CGCGCTGATC
GAACGAAACC AGCGACCGCT GACCGCCGAC CTTCTGCTGC GCCTGGCGCG GGCCTACAAA
CTGGACATGG CAGACCTCGC CGCGGACGAA CGCGACGACT ATGCGCGGCG TCTTTCCGAT
GCACTGCGCG ATCCGATCTT CTCGGACATC GACCTGCCGG CGCTGGAAGT GGCCGACGTC
GCCGCGAGCT TTCCCGGCGT GACCGAGGCA ATGTTGCGGC TTTACGGTGC ATGGCAGCGC
GAACAGCAGG CGTTGGCCGA ACAGCGCAAC GCCGGGCCGG GCACGAGCGC AAGCGACCCG
ATCGGCGAAG CGCGGCGGTT CGTGGCCGCG CGGCGCAACT ACTTCCCGAC CATAGATTCG
CGGGCAGAGG AACTGGCGTC GGAAATCGAC AAAGCGGGCG GGCCGCAGGA ATGGCTCCGC
AAGCAGGGCG TGCGCGTGCG CTTCCTGCCG CCCGACGTGA TGATGGACGC GGTGCGGCGC
TATGACCGGC ACAACGAGCA ACTGCTGATC GACGACACGC TGCCACTGTC CAGCCGCGTC
TGGCAGTTGC TGCAACACAT CGCCTACACC GCGATACGCT CAGAGATCGC GGCGGTGATC
CGGGGGGAAA GCTTTGCCAG CCAGACCGCT GCGAACCTCG TGCGCAGGGC GCTGGCGAGC
TATGCCGCAG CGGCCATCGC CATGCCTTAT GACCGGTTTG CGCGAGCCGT GGATGCGAGG
CGGTACGACA TCGAGGCGCT GTCCGGCCAG TTCGGCACCA GTTTCGAGCA GGTGGCGCAC
CGGCTGACGA CGCTCAACCG CCCGGGGCAG GAGCGAGTGC CGTTCTTCTT CATACGGGTC
GATTCGGCGG GCAACGTGTC CAAGCGGCTC GACGGTGCCG GCTTCCCGTT CGCGGCGCAT
GGCGGCGGTT GCCCGCTGTG GAGCGTGCAC GACACCTTCC GCACGCCGGG GCAGATCGTC
ACGCAATGGC TTGAACTGCC CGACGGACAG CGCTTCTTCT CGCTTGCGCG CACGGTGACG
TCTGGCGGCG GCGGTTTCGA CCGTCCGAGG ATGACGAGGG CCATCGCGCT GGCCTGTGCA
GCCGAACACG CCCCGAGGCT GGTCTATGCG GCGGGGGGCG ATCCAAGGGC GGTTGCGGCA
ACGCCGATCG GAGTGACCTG CCGCCTATGC CACCGCGCCC AGTGCACTGC ACGCGCCGAA
CCTCCCATCG GACGCGAGAT CCTACCCGAC GACTATCGGC GTGGGGCCGA GCCGTTCAGC
TTTGCGGAGA GCTGA
 
Protein sequence
MASRPLYLGP RLKRIRRELG LTQQAMAEEL GISPSYIALI ERNQRPLTAD LLLRLARAYK 
LDMADLAADE RDDYARRLSD ALRDPIFSDI DLPALEVADV AASFPGVTEA MLRLYGAWQR
EQQALAEQRN AGPGTSASDP IGEARRFVAA RRNYFPTIDS RAEELASEID KAGGPQEWLR
KQGVRVRFLP PDVMMDAVRR YDRHNEQLLI DDTLPLSSRV WQLLQHIAYT AIRSEIAAVI
RGESFASQTA ANLVRRALAS YAAAAIAMPY DRFARAVDAR RYDIEALSGQ FGTSFEQVAH
RLTTLNRPGQ ERVPFFFIRV DSAGNVSKRL DGAGFPFAAH GGGCPLWSVH DTFRTPGQIV
TQWLELPDGQ RFFSLARTVT SGGGGFDRPR MTRAIALACA AEHAPRLVYA AGGDPRAVAA
TPIGVTCRLC HRAQCTARAE PPIGREILPD DYRRGAEPFS FAES