Gene Nmul_A0743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0743 
Symbol 
ID3786567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp866878 
End bp867939 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content55% 
IMG OID637810825 
Producthypothetical protein 
Protein accessionYP_411442 
Protein GI82701876 
COG category[S] Function unknown 
COG ID[COG5563] Predicted integral membrane proteins containing uncharacterized repeats 
TIGRFAM ID[TIGR02913] probable extracellular repeat, HAF family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.890074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCTTGA GTACCGGATT GTGCTTTGTT TCCCCCGCCT TTGCACAGCT ACAACGTTTA 
TACATTCTGG ATCTCAAAAG AAACCAACTG ATCACTCTTA TGCCGAGTGA AATGACGGGT
AACGCTACCG GCATCAATAA TGCTGGCCAG GTAGCATGGT ACGCCATCAA GGAATATGGG
CACGCTTTCA TCACCGGCCC CGACGGCGTT GGCAGAACCG ATCTCAAGAC ACTGGGCGGT
GATTCCAGCG CAACCTTTGG CATTAATGAT GCGGGGCAAG TGGTAGGATG CTCACAGACA
GCTACAGGTA ATTTGCATGC TTTTATTACC GGCCCCAACG GCGTGGGCAT GACAGACCTG
GGGACACTGG GAGAACGTGC CAGTTACGCC ACCAGCATCA ATAATGCTGG CCAGGTGGCA
GGATATACCG TCAAGGAATA TGGGCACGCT TTCATCACTG GTCCCAATGG TGCGGGCATG
ACCCATCTGG AAAGCCTGCC GGACGGACTC ACAACTGTTG CTCATGATAT CAATGATGCC
GGACAGGTGG TCGGGAGGGG CGTGAGGCAC GCTTTCATCA CTGGCCGCAA TGGCGTGGGG
ATGAAGGATC TCGGGACCCT GGGTGGAGAT TACAGTGTCG CCTATGGCAT CAATGAGGCC
GGAGAGGTGG TAGGGGGTTC CAGCACGGCT GCCGGTTATA CGCACGCTTT CATCACTGGT
CCCAATGGCG TGGGGATGAC GGATCTCGGG ACGCTGGGTG GAGGTTACAG TATCGCCACT
GACATCAACG ATGCCGGACA GGTGGTGGGA TGGTCCACCA TGGCTGCCGG TGATAACCAT
GCCTTTATCA CCGGTCCCAA TGGCGTAGGC ATGATGGACC TTAATTCACT GCCCTGGTTA
CCGCCCGGAT ACGTTATAAC GGGCGCCATC AGCATCAATG ACAGGGGACA GGTCGTCGTT
ATTGCCGATC TTCCCAAACC CGAGGCTTAT ATGCTGATAT TCGCTGGCCT GGGCCTGATC
GGGTTTCTGG TATGGTGGCA GAAGTCGGGA AGCAGCGCCT AG
 
Protein sequence
MVLSTGLCFV SPAFAQLQRL YILDLKRNQL ITLMPSEMTG NATGINNAGQ VAWYAIKEYG 
HAFITGPDGV GRTDLKTLGG DSSATFGIND AGQVVGCSQT ATGNLHAFIT GPNGVGMTDL
GTLGERASYA TSINNAGQVA GYTVKEYGHA FITGPNGAGM THLESLPDGL TTVAHDINDA
GQVVGRGVRH AFITGRNGVG MKDLGTLGGD YSVAYGINEA GEVVGGSSTA AGYTHAFITG
PNGVGMTDLG TLGGGYSIAT DINDAGQVVG WSTMAAGDNH AFITGPNGVG MMDLNSLPWL
PPGYVITGAI SINDRGQVVV IADLPKPEAY MLIFAGLGLI GFLVWWQKSG SSA