Gene Nwi_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2020 
SymbolengA 
ID3677090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2210944 
End bp2212311 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID637713584 
ProductGTP-binding protein EngA 
Protein accessionYP_318631 
Protein GI75676210 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.866056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTTA CCATTGCTAT CATAGGCCGT CCGAACGTCG GAAAATCGAC GCTGTTCAAC 
CGTCTGGTCG GGCAGAAGCT GGCGTTGGTT GACGACGAGC CGGGCGTGAC GCGCGATCGC
CGCGAGGGGC AGGCGCGTCT CGGCGATCTC GATTTCACGG TGATCGACAC CGCCGGCCTC
GACGAGGGAC CGCGCGGCTC TCTGACGGCG CGCATGCAGG AGCAGACCGA GGCCGCGATT
GCAGCTGCCG ATGCGCTGAT GTTCGTATTC GATGCGCGCG CGGGCCTCAC GCCGACGGAT
CGCTCATTCG CGGATTTCGC GCGCCGCGCC GACAAGCCGG TCGTGCTCGT CGCCAACAAG
AGCGAGGGCA GGCACGGGGA CGCCGGTGCG CTGGAATCCT ACGCGCTCGG GCTCGGCGAT
CCGGTCGGCG TATCCGCGGA ACACGATGAA GGCATGAGCG ATCTTTATGA TGCCTTGCGC
TCGGTGATGC CGGAGCCGGC GGAAGAGGTC GACGAGGAGG AGATCGTCGA GCCCGATATG
TCGCGGCCGA TCCGCGTGGC CATTGTCGGG CGGCCCAACG CGGGCAAATC GACCGTGATC
AATTATCTGC TCAGCGAGGA GCGGCTGCTG ACGAGTCCGG AAGCCGGCAC GACGCGGGAC
TCGATCTCGG TCGAGCTTAA CTGGAAGGGA CGCGATTTCC GCATCTTCGA CACCGCCGGA
TTGCGGCGCA GGTCGCGGAT CGAGGCAAAG CTCGAGAAAT TGTCGGTGGC GGATACGTTG
CGCGCCGTCA GGTTCGCCGA AGCCGTCGTG TTGATGATGG ATGCGCAGAA CAGGTTCGAG
GAGCAGGATC TCCGCATCGC CGATTTGATC GAGCGCGAAG GCCGCGCGCT CGTGATTGCC
GTGAATAAAT GGGACTTGAT GAAGGGCGGT TCGGCGCGGA TCGCCTCGCT GCGCAACGAT
GTCGATCACT GGCTGCCTCA AATCAGGGGT GCTCCGGTGG TCGCGATTTC TGGCCTGACA
GGAGAGGGAA TTGATAGGCT GATGATCGCG ATCCAAACCG CCTATGCCGT ATGGAATCGC
CGTGTCGCAA CGGCGTTGCT CAATCGCTGG TTTCAGCAGG CGGTCGCAGC CAGTCCGCCG
CCCGCGGTCT CCGGTCGTCG GCTGAAGCTC AACTACGCAA CGCAGACCAA GGCGCGTCCG
CCGAGCTTTG TGGTGTTTTG TTCGCGGGCG GATGCCGTTC CGGAATCTTA TCTGCGCTAT
CTGGTCAACA GCTTGCGTGA GACCTTTGAT CTGGCAGGCA CGCCGATCCG GATCACGCTT
CGCGAAAAAG CCAATCCGTT CGCGCATAAG CGCAAGCGCA AGTCATAG
 
Protein sequence
MSFTIAIIGR PNVGKSTLFN RLVGQKLALV DDEPGVTRDR REGQARLGDL DFTVIDTAGL 
DEGPRGSLTA RMQEQTEAAI AAADALMFVF DARAGLTPTD RSFADFARRA DKPVVLVANK
SEGRHGDAGA LESYALGLGD PVGVSAEHDE GMSDLYDALR SVMPEPAEEV DEEEIVEPDM
SRPIRVAIVG RPNAGKSTVI NYLLSEERLL TSPEAGTTRD SISVELNWKG RDFRIFDTAG
LRRRSRIEAK LEKLSVADTL RAVRFAEAVV LMMDAQNRFE EQDLRIADLI EREGRALVIA
VNKWDLMKGG SARIASLRND VDHWLPQIRG APVVAISGLT GEGIDRLMIA IQTAYAVWNR
RVATALLNRW FQQAVAASPP PAVSGRRLKL NYATQTKARP PSFVVFCSRA DAVPESYLRY
LVNSLRETFD LAGTPIRITL REKANPFAHK RKRKS