Gene Nmul_A2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2109 
Symbol 
ID3784680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2406689 
End bp2407903 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content55% 
IMG OID637812197 
Producthypothetical protein 
Protein accessionYP_412794 
Protein GI82703228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.657946 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC TTCGCGTCCC TCATCGCACC AAGTCTGCGC GCGTGGTTCA GGAGCTGGTC 
CGCCTGGCTG CGGGACTTGC CGCCTCTGGC AGCCGCGTAG AAGACGCCTT CTGGGAGAAT
TGCCTTACCG CAAAAATACA TGCCCAATTG CGAACGGGAG ACGATCAGAA TCTCGAATCT
GCACTGGATC AGCTTTATGA CACTGATCCT GGAGGCTACA GTGAATTCAT GTATGCCATC
GAAGCCGCAA TAGAATGCAG TATATTCACG CGAGGGAATC ACACTTACGA CGTTTTGATG
CTTGCTGTAC CGATGCTCAC CTGGTCGCGT TTTTCGATTC CCTCGGGACT AATAGCAGAA
TCCGTGCTTT CTGAACTTCG GGGGCAGTTG CAGGTGCAGG TGTTGGCGGA CGATGCCCAG
CTTGCGCTTG CAGATTGTCT TTTCAGCCCC GATCAATTGC CAAGAGGCTA TCATCTTACA
CATGAACTGG CTCGCAAACT CTGGTCGACG GCGGTGACCG GACAACGCGA CTTGCATATG
AACCCACGGC AGCTCCCTGA AACGGGACGA TTTCTTTCTG ATACCCGTTA TCTCCTCGGC
GGGATCGCGG TACCTCAGGG AAAACCCATG TTCCGCTGGC AGCAAGGCAC TGCGGGCGAA
CACAAGGGCA GCAACGTTTC CGCTCTCCAG TTCTCCAAAG AGCAGATACT GCATGCATGG
CAAACTCACG GTACTGCGGT GCTGCTGCCG CTATTCCAGG GCTGTGCGTT TGAGCTGCTG
ATGCCGGATG GCTTTTTTTC AGCATGGCGC GCCGCGGATC GTCTGGCGCG CCCCTATTCG
GTGCGCGCCA CGGTTGATTT TCTGGAAACC ACTTTGGGTA TTTCCCCCGA CAGATTGCGT
GCAGTGATCG CGCCTTTCTA TGATCAGTGG CTGGAGGAAT ACCGCATTGG ATTCACCCTC
AAGGATCGGG ACAGCGTGCT CTACGGCATC ATCTGGGGAC TGGTAGGGGA TGAGGATGAA
AATACGGATA GTGTCGCTCA GATCGAGGCA GCGCTGCGGG AATGCGGAGT GACACAAACC
ATACTACTGC AGGAGCATTT TCTTTTGGAG TACTGCGAAG AATGCGGGGG ACCCCTGTTT
CCCAACGTGA ATGGTGAAAT CGTGCATGCA GAATTCCCGG AAGAAGGTGA AGTGGCCCCC
ATACACCTGC ATTGA
 
Protein sequence
MKKLRVPHRT KSARVVQELV RLAAGLAASG SRVEDAFWEN CLTAKIHAQL RTGDDQNLES 
ALDQLYDTDP GGYSEFMYAI EAAIECSIFT RGNHTYDVLM LAVPMLTWSR FSIPSGLIAE
SVLSELRGQL QVQVLADDAQ LALADCLFSP DQLPRGYHLT HELARKLWST AVTGQRDLHM
NPRQLPETGR FLSDTRYLLG GIAVPQGKPM FRWQQGTAGE HKGSNVSALQ FSKEQILHAW
QTHGTAVLLP LFQGCAFELL MPDGFFSAWR AADRLARPYS VRATVDFLET TLGISPDRLR
AVIAPFYDQW LEEYRIGFTL KDRDSVLYGI IWGLVGDEDE NTDSVAQIEA ALRECGVTQT
ILLQEHFLLE YCEECGGPLF PNVNGEIVHA EFPEEGEVAP IHLH