Gene Nwi_1641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1641 
Symbol 
ID3675433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1782884 
End bp1783984 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content65% 
IMG OID637713199 
Producthypothetical protein 
Protein accessionYP_318254 
Protein GI75675833 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1509] Lysine 2,3-aminomutase 
TIGRFAM ID[TIGR00238] KamA family protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.123258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.700598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC GGATCACGAA AGGCATCGCT TCGACCTTGC GGCGTCCCGA AGACCTGATC 
GCTCACGGTC TCGCGCCGGC CGCGGCTCTG GCCGATCTCG AAAAGGTTGC CGCTCGCTAT
GCGATTGCGA TCACGCCTGA GGTGGCGAAC CTGATCGATC CGGCCGATCC TGATGATCCG
ATCGCGCGGC AGTATCTCCC GAGCGCAGAT GAACTGGCGG CGCAAGCGTA CGAACGCGCC
GACCCGATCG GCGATCACGC GCGTTCTCCG GTGGACGGCA TCGTTCATCG TTATCCCGAC
CGGGTGCTGC TCAAGCTCGT TCACGTCTGC GCTGTCTACT GCCGCTTCTG CTTCCGCCGC
GAAATGGTGG GACCCGCCAA GGAAACCGCG CTGTCGAAAT CAGCGACCGC GGCGGCGCTC
GATTACATCC GGTCGCATCC GGAAGTCTGG GAGGTGATCC TGACCGGCGG CGACCCGTTG
ATGCTGTCGC CGCGACGGCT GGCCGAGATC ATGGCCGAGC TGGCCGCCAT CGACCACGTC
AGGATCGTCC GCATCCATAC GCGCGTTCCG GTTGCCGATC CCGCGCGCGT CACCGACGAG
ATGGCGGCGG CCCTGCGCAC GGACGGCGCC ACCACCTGGC TGGCGCTGCA CGCCAATCAT
CCGCGCGAAC TGACAGCGGC GGCGCGAGCG GCCTGCGCGC GCATCATCGA TGCGGGTATT
CCCATGGTCA GCCAGTCGGT GCTGCTGCGC GGCGTCAACG ATGACGCGGC GACGCTGGAG
GCGCTGATGC GCGCCTTCGT CCAATGCCGT ATCAAGCCCT ACTATCTGCA TCACGGCGAT
CTCGCGCCGG GGACGGCGCA TCTGCGGACC ACGCTGGAAC AGGGGCAAGC CCTGATGCGC
GAGCTGCGCG GCCGTGTATC CGGCCTGTGC CAGCCGGATT ATGTCCTCGA TATTCCCGGC
GGATACGGCA AGTCGCCGGT GGGCCCGGGC TATATGTCGC CCTCAGATTT AATCTCCGGA
GCAGGTGAAC ATCGGCCGGA ATTGCACTAT CTTATTACCG ACTACTGTGG CGGCGTTCAT
CTCTATCCGC CGAAGCCATA G
 
Protein sequence
MSARITKGIA STLRRPEDLI AHGLAPAAAL ADLEKVAARY AIAITPEVAN LIDPADPDDP 
IARQYLPSAD ELAAQAYERA DPIGDHARSP VDGIVHRYPD RVLLKLVHVC AVYCRFCFRR
EMVGPAKETA LSKSATAAAL DYIRSHPEVW EVILTGGDPL MLSPRRLAEI MAELAAIDHV
RIVRIHTRVP VADPARVTDE MAAALRTDGA TTWLALHANH PRELTAAARA ACARIIDAGI
PMVSQSVLLR GVNDDAATLE ALMRAFVQCR IKPYYLHHGD LAPGTAHLRT TLEQGQALMR
ELRGRVSGLC QPDYVLDIPG GYGKSPVGPG YMSPSDLISG AGEHRPELHY LITDYCGGVH
LYPPKP