Gene Saro_2492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2492 
SymbolnusA 
ID3916812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2693350 
End bp2694984 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content66% 
IMG OID640445248 
Producttranscription elongation factor NusA 
Protein accessionYP_497762 
Protein GI87200505 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0183695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGTG CAATTTCCGC GAACCGCGCC GAACTGCTTG CGATCGCCAA CGCGGTCGCC 
ACCGAGAAGA TGATCGACAA GTCGATCGTC ATCGAGGCGA TGGAAGAGGC GATCCAGAAG
TCCGCGCGCA ACCGCTACGG CGCCGAGAAC GACATTCGCG CCAAGCTCGA CCCGCGCACC
GGCGACCTGC GCCTGTGGCG CGTGGTGGAA GTGGTCGAGG TGGTCGAGGA CTACTTCAAG
CAGGTCGATC TCAAGCAGGC CGAGAAGCTC CAGCCCGGCG CCAAGATCGG CGACTTCATC
GTCGATCCGC TGCCCCCGGT CGATCTCGGC CGCATCGACG CGCAGTCGGC CAAGCAGGTC
ATCTTCCAGA AGGTCCGCGA CGCCGAGCGT GATCGCCAGT ACGACGAGTT CAAGGACCGC
GCCGGCGAAG TCATCACCGG CGTGATCAAG TCGGTCGAAT TCGGCCACGT GATCGTCAAC
CTCGGCCGCG CCGAAGGCGT GATCCGCCGC GACCAGCAGA TCCCCCGCGA AGTGCCCCGC
GTGGGCGAGC GCGTGCGTGC GCTGATCCTC AAGGTCGAGC GCCAGAACCG CGGTCCGCAG
ATCTTCCTGT CCCGCGCGCA CCCCGAATTC ATGAAGAAGC TCTTCGCGCA GGAAGTGCCC
GAGATCTACG ATGGCATCAT CGAGATCAAG GCCGCCGCCC GCGACCCGGG CTCGCGCGCC
AAGATCGGCG TGATCAGCCG CGACAGCAGC ATCGACCCGG TCGGCGCCTG CGTCGGTATG
AAGGGTAGCC GCGTCCAGGC GGTCGTGCAG GAACTGCAGG GCGAGAAGAT CGACATCATC
CCCTGGAGCG AGGACACCGC GACTTTCGTC GTCAACGCTC TCCAGCCCGC GACCGTCAGC
CGCGTGGTGA TCGACGAGGA AGAGAGCCGC ATCGAGGTCG TCGTGCCCGA TGACCAGCTC
TCGCTCGCCA TCGGTCGCCG CGGTCAGAAC GTGCGCCTTG CCTCGTCGCT GACGGGCTCG
GCCATCGACA TCATGACCGA GGCGGAAGCT TCGGAGAAGC GCCAGAAGGA ATTCGCCGAG
CGCTCGAAGA TGTTCGAGGA AGAGCTCGAC GTCGACGAAA CCCTCTCGCA GCTCCTCGTC
GCCGAAGGCT TCACCGAGCT GGAGGAAGTG GCCTACATCG AAATGGCCGA ACTGGCCGCG
ATCGAGGGCT TCGACGAGGA ACTCGCCGAG GAACTGCAGA GCCGCGCGTC CGAGGCGATC
GAACGCCGCG AGGAGGCTCT GCGCGAGCAG CGTCGTGCCC TTGGCGTCGA CGATGCCCTG
GCCGAACTGC CGCACCTGAC CGAAGCGATG CTCGTCGCGC TCGGCAAGGC CGGCATCAAG
ACGCTCGACG ATCTTGCCGA TCTCGCGACC GATGAACTCA TCGCCAAGAA GCGCGCCGAA
CAGCGTCGCC GCAACGACAA GGGCCCGCGC GAGCGGACCG AGCGTTCGGA GCGGACAGAG
GACAAGGGCG GCGTGCTCGG CGAATTTGGC CTGAGCGAAG AACAGGGCAA CGAGATCATC
ATGGCCGCGC GTGCCCACTG GTTCGAAGAC GAACCGGTAG CTGAGGAGGC CGCCGATGCG
GATTCCTCAC AATGA
 
Protein sequence
MASAISANRA ELLAIANAVA TEKMIDKSIV IEAMEEAIQK SARNRYGAEN DIRAKLDPRT 
GDLRLWRVVE VVEVVEDYFK QVDLKQAEKL QPGAKIGDFI VDPLPPVDLG RIDAQSAKQV
IFQKVRDAER DRQYDEFKDR AGEVITGVIK SVEFGHVIVN LGRAEGVIRR DQQIPREVPR
VGERVRALIL KVERQNRGPQ IFLSRAHPEF MKKLFAQEVP EIYDGIIEIK AAARDPGSRA
KIGVISRDSS IDPVGACVGM KGSRVQAVVQ ELQGEKIDII PWSEDTATFV VNALQPATVS
RVVIDEEESR IEVVVPDDQL SLAIGRRGQN VRLASSLTGS AIDIMTEAEA SEKRQKEFAE
RSKMFEEELD VDETLSQLLV AEGFTELEEV AYIEMAELAA IEGFDEELAE ELQSRASEAI
ERREEALREQ RRALGVDDAL AELPHLTEAM LVALGKAGIK TLDDLADLAT DELIAKKRAE
QRRRNDKGPR ERTERSERTE DKGGVLGEFG LSEEQGNEII MAARAHWFED EPVAEEAADA
DSSQ