Gene Nmul_A1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1060 
Symbol 
ID3784880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1225735 
End bp1226652 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content53% 
IMG OID637811144 
Productbranched-chain amino acid aminotransferase 
Protein accessionYP_411755 
Protein GI82702189 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01122] branched-chain amino acid aminotransferase, group I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATGG CTGACCGCGA CGGTGTGATC TGGAGTGATG GAAAGATGAT TCCGTGGCGT 
GAGGCTACCA CACATGTACT CACCCACACC CTGCATTATG GGATGGGGGT ATTCGAAGGA
TTGCGTGCCT ACGAAACCCC CCGCGGATCT GCAATTTTCC GTCTGAAAGA ACACACCGAT
CGTTTGTTCA ATTCCGCGCA CATCTTCATG ATGAAGATGC CCTATGACAA GGCGACGCTG
ATACAGGCGC AGTGCGATGT CGTAAGGCAG AACGATCTGA AGTCGTGTTA TATCCGTCCC
ATCGTGTTTT ATGGTTCCGA AGCCATGGGC ATTTCAGCTA AAACGCTTTC GGTGCACGTG
GCTATTGCAG CTTGGGCGTG GGGCACGTAT CTTGGTCCTG ATGGCCTCGA AAAAGGCATC
CGTGTCAAGA CTTCGTCATT TACGCGGCAT CATGTGAATA TCAATATGTG CCGTGCCAAG
TCGGTCACGA CCTATGCAAA TTCCATCCTC GCGCATCAGG AGGTAGCGCA TGATGGCTAT
GATGAGGCGC TGCTTCTCGA TGTGGACGGC TATGTTGCTG AAGGGGCTGG TGAAAACATA
TTCATCGTGA AGCAGGGCAA ATTGTATACG CCTGACTTGA CTTCCTGTCT GGAAGGCATT
ACGCGCGCAT CTCTCATAGA GCTTGCGGAA GAAATCGGAA TCCCGGTTAT CGAGAAGCGC
ATCACCCGCG ATGAAGTCTA TTGCGCGGAT GAAGCCTTTT TCACCGGCAC CGCAGCCGAG
GTAACACCAA TCAGGGAACT GGATAACCGC ACGATCGGCA GCGGCAGGCG TGGTCCTATT
ACAGAAAAGC TCCAGGCCCT CTTTTTTGAA TGTGCCAGAG GCAACGGCAA ACATGCCGAG
TGGCTCACCC ATGTCTGA
 
Protein sequence
MSMADRDGVI WSDGKMIPWR EATTHVLTHT LHYGMGVFEG LRAYETPRGS AIFRLKEHTD 
RLFNSAHIFM MKMPYDKATL IQAQCDVVRQ NDLKSCYIRP IVFYGSEAMG ISAKTLSVHV
AIAAWAWGTY LGPDGLEKGI RVKTSSFTRH HVNINMCRAK SVTTYANSIL AHQEVAHDGY
DEALLLDVDG YVAEGAGENI FIVKQGKLYT PDLTSCLEGI TRASLIELAE EIGIPVIEKR
ITRDEVYCAD EAFFTGTAAE VTPIRELDNR TIGSGRRGPI TEKLQALFFE CARGNGKHAE
WLTHV