Gene Nwi_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0022 
SymbolnusA 
ID3676462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp22246 
End bp23862 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content64% 
IMG OID637711557 
Producttranscription elongation factor NusA 
Protein accessionYP_316642 
Protein GI75674221 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0361846 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCCG TCAGCGCCAA TAAGCTCGAA CTGCTGCAGA TCGCAGACGC GGTGGCGCGC 
GAAAAGACCA TCGACCGCGG CATCGTGATC GCGGCGATGG AGGACGCCAT CGCGAAGGCG
GCGCGGGCGC GGTACGGCAG CGAGACTGAC GTTCATGCGG AGATCCACCC GAAGACCGGA
CAGCTCCAGC TCACCCGCCA CATGCTGGTG GTCGAGCAGG TCGAGAATGC CGCGAACCAG
ATCTCGCTGA AGGACGCCCA GCGGGCCAAT CCCGGCGCAC AGATCGGCGA CACCATCGCC
GATACGCTGC CGCCGCTGGA ATATGGCCGC ATCTCCGCGC AGTCCGCCAA ACAGGTGATC
GTGCAGAAGG TGCGCGAGGC CGAACGCGAC CGGCAGTATC AGGAGTTCAA GGATCGCATC
GGCGATATCG TCAACGGGAT TGTCAAGCGT GTCGAATACG GCAGCGTGAT CGTCGACCTC
GGCCGCGGCG AAGCCGTCAT TCGCCGCGAC GAGATGCTGC CGCGCGAGGT ATTCCGCAAC
GGAGACCGCG TCCGGGCCTA TATCTTCGAC GTGCGCCGCG AAACCCGCGG CCCGCAGATC
TTCCTGTCGC GCACCCATCC GCAGTTCATG GTGAAGCTTT TTACGCAGGA AGTGCCGGAA
ATCTACGACG GCATCGTCGA GATCAAGGCG GTGGCGCGCG ACCCCGGTTC CCGCGCCAAG
ATCGGCGTGG TGTCGCGGGA TTCCTCGGTC GATCCGGTCG GCGCCTGCGT CGGCATGCGC
GGCTCGCGCG TTCAGGCGGT CGTCAACGAG TTGCAGGGCG AAAAGATCGA CATCATCCCG
TGGTCGCCGG ATATCGCGAC CTTCGTCGTC AACGCGCTGG CGCCGGCGGA GGTCGCGAAA
GTCGTGATCG ACGAGGACCG CGAGCGGATC GAGGTCGTGG TCCCCGACAC CAACAACCAG
TTGTCGCTCG CGATCGGACG GCGCGGACAG AACGTGCGGC TCGCCTCGCA ACTGACGGGC
TGGGACATCG ACATCCTGAC CGAGCAGGAG GAATCGGAGC GACGCCAGGC GGACTTCGAG
AATGCGACCC GCATGTTCAT GGAAACACTC AACGTCGACG AGGTCGTCGG GCAGTTGCTG
GCGTCCGAGG GTTTCACCTC TGTTGAAGAA CTGACGCTGG TCGACACGCG GGAAATCGCC
GGCATCGAGG GCTTTGACGA CGAAACCGCG ACCGAATTGC AGAACCGGGC CCGAGAATAT
CTGGAGCAGC TCGAGGCCGA ACTGGAAAAC AGGCGCAAGG AGCTCGGCGT GGATGATGCG
CTGAAGACCG TGCCCGGCGT GACCTCGAAG ATGCTGGTGA AACTCGGCGA AAACGAAGTC
AGGACCATCG AGGATCTGGC CGGCTGCGCC ACCGACGATC TGGTCGGCTG GACCGAACGC
AAGGAGGGCG AAGCGGTCAA GCACGCGGGT TATTTCGACG GCATCGAGAT CTCGCGGGAG
GACGCGGAGG CCATCATCAT GCAGGCTCGC CTGAGTGTCG GCTGGATCAA CGAGGCCGAC
CTCGCGAAAC CGGCGGAGGC TGAGGATATC GCTGCTGAAG ATCAGCCGGC CGAATGA
 
Protein sequence
MAAVSANKLE LLQIADAVAR EKTIDRGIVI AAMEDAIAKA ARARYGSETD VHAEIHPKTG 
QLQLTRHMLV VEQVENAANQ ISLKDAQRAN PGAQIGDTIA DTLPPLEYGR ISAQSAKQVI
VQKVREAERD RQYQEFKDRI GDIVNGIVKR VEYGSVIVDL GRGEAVIRRD EMLPREVFRN
GDRVRAYIFD VRRETRGPQI FLSRTHPQFM VKLFTQEVPE IYDGIVEIKA VARDPGSRAK
IGVVSRDSSV DPVGACVGMR GSRVQAVVNE LQGEKIDIIP WSPDIATFVV NALAPAEVAK
VVIDEDRERI EVVVPDTNNQ LSLAIGRRGQ NVRLASQLTG WDIDILTEQE ESERRQADFE
NATRMFMETL NVDEVVGQLL ASEGFTSVEE LTLVDTREIA GIEGFDDETA TELQNRAREY
LEQLEAELEN RRKELGVDDA LKTVPGVTSK MLVKLGENEV RTIEDLAGCA TDDLVGWTER
KEGEAVKHAG YFDGIEISRE DAEAIIMQAR LSVGWINEAD LAKPAEAEDI AAEDQPAE