Gene Acry_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0223 
SymbolnusA 
ID5161504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp257162 
End bp258682 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content66% 
IMG OID640552139 
Producttranscription elongation factor NusA 
Protein accessionYP_001233370 
Protein GI148259243 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATGG ACACCGCCAT AGCCCGCCCC GAACTCCTTC TTGTCGCCGA TGCGGTGGCC 
CGCGAAAAGC AGATCGATCG CGAGGAAGTG CTCGAGGCGA TGGAGCAGGC CATCCAGAAG
GCCGGTCGCG CCAAATATGG CCACGAGAAG GACATTCGGG CGACGATCGA TCGCAAGACC
GGCGATGTCC GCCTTTCGCG CTGGACCGAG GCGGTCGAGA CTGTGGAGAA CGAGGAAACC
CAGATCCCGA TCCATATCGC CCGCAAGTTC AAGCCCGATA TCGAAGTCGG CGGGCATCTG
GTCGATCCTT TGCCGCCGAT CGATTTCGGC CGCATCGCGG CGCAGACGGC GAAGCAGGTG
ATCGTCCAGC GCGTCCGCGA ATACGAGCGC AAGCGCCAGT ACGACGAATA CAAGGACCGT
GTCGGCGAGA TCATCACCGG CGTGGTCAAG CGCACCGAAT ACGGCAACCT CATGGTCGAT
CTCGGCCGCT CGGAAGCCCT GCTCCGGCGC GACGAGACGA TTCCCCGCGA GAACCTGCAC
AATGGCGACC GGGTGCGTGC CTTCATCTAC GACGTGCGCG AGGAACCGCG CGGCCCGCAG
ATCTTTCTCT CACGCACCCA TCCGGGCTTC CTCGCCAAGC TCTTCGCCCA GGAAGTGCCG
GAAATCTACG AGGGGATCAT CGAGATCAAG GCGGTGGCCC GCGATCCTGG CTCGCGCGCC
AAGATGGCGG TGATCAGCCG CGATTCCTCC ATCGACCCGG TCGGGGCCTG CGTCGGCATG
CGCGGCTCGC GCGTGCAGGC GGTGGTGGCC GAACTGCAGG GCGAAAAGAT CGACATCATT
CCGTGGAGCC CGAATCCGGC GACCTTCGTG GTCAACGCGC TCGCCCCGGC CGAGGTCTCG
AAGGTCGTGC TCGACGAGGA GGCCGGCAAG GTCGAGGTCG TCGTGCCCGA CACCCAGCTC
TCGCTCGCGA TCGGCCGGCG CGGCCAGAAT GTCCGCCTTG CCAGCCAGCT TACCCGCTGG
GATATCGACA TCTTGACCGA GGCCGAGGAA AGCGAACGGC GCCAGGAAGA GTTCCGCCGC
CGCTCCGGCC TGTTCGTCGA GGCGCTCGAT GTCGATGACG TGATCGCCGG CCTGCTGGTC
ACCGAAGGGT TCGAGGGCGT CGAGGATCTC GCCGCGACGC CGGTCGAGGA ACTTGCGGCG
ATCGAGGGAT TCGATGAGGG GATCGCCGCC GAACTGCAGC GCCGCGCCGA GGTGGCGCTC
GAGCGCAAGG CCACCGAACT TGAGGACAAG CGGCGCGCGC TGGGTGTCGC CGATGATCTT
GCCGGGCTGG AGGGGCTATC GCCTGCCATG CTGGTGGCGC TCGGCGAGAA GGGTGTGAAG
ACGCTGGACG ATCTTGCCGA TCTTGCCTCT GACGAACTGA TCGAGATCGT CGGCGCCGAT
GCGATGGACG AGGACGCGGC GAATGCCATC ATCATGGCGG CGCGCGCTCA CTGGTTCGAG
GGAGAGGAAG ACGCTGGCTG A
 
Protein sequence
MTMDTAIARP ELLLVADAVA REKQIDREEV LEAMEQAIQK AGRAKYGHEK DIRATIDRKT 
GDVRLSRWTE AVETVENEET QIPIHIARKF KPDIEVGGHL VDPLPPIDFG RIAAQTAKQV
IVQRVREYER KRQYDEYKDR VGEIITGVVK RTEYGNLMVD LGRSEALLRR DETIPRENLH
NGDRVRAFIY DVREEPRGPQ IFLSRTHPGF LAKLFAQEVP EIYEGIIEIK AVARDPGSRA
KMAVISRDSS IDPVGACVGM RGSRVQAVVA ELQGEKIDII PWSPNPATFV VNALAPAEVS
KVVLDEEAGK VEVVVPDTQL SLAIGRRGQN VRLASQLTRW DIDILTEAEE SERRQEEFRR
RSGLFVEALD VDDVIAGLLV TEGFEGVEDL AATPVEELAA IEGFDEGIAA ELQRRAEVAL
ERKATELEDK RRALGVADDL AGLEGLSPAM LVALGEKGVK TLDDLADLAS DELIEIVGAD
AMDEDAANAI IMAARAHWFE GEEDAG