Gene Csal_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3075 
Symbol 
ID4028879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3425829 
End bp3427322 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content63% 
IMG OID637968287 
ProductNusA antitermination factor 
Protein accessionYP_575118 
Protein GI92115190 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG AGATTCTGCT GGTCGTAGAT GCGATCTCGA ACGAAAAGGG GGTGCCGCGT 
GAGGTGATCT TCGAGGCCGT GGAGTCGGCG CTGGCGAGTG CATCGCGTAA GCGCTTCGAT
GGCCAGGAAG TGAGCACTCG CGTCAAGATC GATCGCGCGA CCGGTGATTA CGACACCTTC
CGTCGCTGGA CTGTCGTCGA GGACGAGGCG TTCGAGAATC CGGATAGCGA AATCCCGCTG
TCCGAGGCGG AGCGTCGCGA TCCGCCCCTG GCGCTGGGTG ACGTGGTCGA GGAGCAGGTC
GAGTCGGTCG CATTCGGTCG CATCGCCGCC CAGACCGCCA AGCAGGTCAT CGTGCAGAAG
GTGCGCGAGG CCGAGCGTGC CGAGGTCGTC CGCCAATACG CCGAGCGTGA AGGCGAACTG
GTGGCCGGTA TCGTCAAGAA GACCACGCGC GAAGGATTGA TCGTCGACCT GGGCGAGAAT
GCCGAAGCGT TCCTGCCGCG CAGCGAGATG ATCCCCGGCG AGCGCTATCG CATGAACGAA
CGCGTGCGTG CCCTGTTGTG GAAGGTCGAC GCCGAGGCGC GCGGGTCGCA GTTGATCCTG
ACGCGGACGC GCCCCGACAT GATCGTCGAG CTCTTCAAGA TCGAGGTGCC CGAGATCGCC
GAGCAGCTCA TCGAGATCAA GGGCGCGGCG CGCGATCCCG GCGCTCGCGC CAAGATTGCA
GTCAAGACCA ATGACAAGCG CATCGACCCG GTCGGTGCGT GTGTCGGCAT GCGCGGGTCG
CGCGTGCAAG CGGTGTCCAA CGAGCTGCGC AACGAGCGCG TGGACATCAT CATGTGGGAC
GACAACCCGG CGCAACTGGT GATCAATGCC ATGGCGCCCG CGGAAGTCGG CTCCATTCTC
GTCGACGAAG ACGCCCATTC CATGGATGTC GCCGTCGCCG AGGACAACCT GGCGCAGGCC
ATCGGCCGCA GCGGCCAGAA CGTTCGCCTG GCTTCCGAGC TGACGGGCTG GACGCTCAAC
GTGATGACCG AAGAAGAGGC CGAAGGCAAG CGCGAGCAAG AAATCGACAG CCTGATCGAG
TACTTCATCA ATCACCTCGA AGTCGACGAG GAACTCGCCC GTCTGCTGGT CGAGGAAGGC
TTTACCTCGC TCGAGGAGCT GGCGTATGTG CCGCTCGAAG AGTTGCTCGA GATCGAGGAA
TTGGATGAAG CACTGATCGA AACGCTGCGC GCTCGAGCCA AGGACGAATT GCTGACGCTG
GCCATCGCTT CCGAAGAGGC GCTGGACGGT GCGCAGCCCG ATGACGACCT CCTTGAAATG
GAGGGCATGG ATCGTCACTT GGCATTTACG CTCGCCAGTC GAGGCATCGT CACGCGTGAG
GACTTGGCCG AGCAGTCCAT CGATGATCTC AAGGACATCG ACGATGTGGA CGAAGAGCGC
GCCGCGGCGC TGATAATGAC CGCTCGTGCG CCTTGGTTCG AGAGCGAACA GTAA
 
Protein sequence
MSKEILLVVD AISNEKGVPR EVIFEAVESA LASASRKRFD GQEVSTRVKI DRATGDYDTF 
RRWTVVEDEA FENPDSEIPL SEAERRDPPL ALGDVVEEQV ESVAFGRIAA QTAKQVIVQK
VREAERAEVV RQYAEREGEL VAGIVKKTTR EGLIVDLGEN AEAFLPRSEM IPGERYRMNE
RVRALLWKVD AEARGSQLIL TRTRPDMIVE LFKIEVPEIA EQLIEIKGAA RDPGARAKIA
VKTNDKRIDP VGACVGMRGS RVQAVSNELR NERVDIIMWD DNPAQLVINA MAPAEVGSIL
VDEDAHSMDV AVAEDNLAQA IGRSGQNVRL ASELTGWTLN VMTEEEAEGK REQEIDSLIE
YFINHLEVDE ELARLLVEEG FTSLEELAYV PLEELLEIEE LDEALIETLR ARAKDELLTL
AIASEEALDG AQPDDDLLEM EGMDRHLAFT LASRGIVTRE DLAEQSIDDL KDIDDVDEER
AAALIMTARA PWFESEQ