Gene Nmul_A1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1030 
SymbolxseA 
ID3785157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1191375 
End bp1192718 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content57% 
IMG OID637811114 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_411725 
Protein GI82702159 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000189378 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCACG TTCTGGAAAT GGGTTTCGAG CGGGCGGTCA TGAGCGTAAG CGAATTGAAC 
CGCAATGCCA AGGAGTTGCT GGAGCAGGCT TTTCCATTAT CCTGGGTGGC TGGCGAGATC
TCCAATATCA AGTGCTATGG CTCCGGCCAC TGGTATTTTT CCCTGAAGGA TGAGATTGCT
CAGGTGCGCT GTGTCATGTT CCGGGAAAAA AACCAGTATC TTGATTGGCA GCCTCGGGAT
GGCATGCGGG TGGAAGTGCG CGCCCTGGTA ACGCTATATC ATGCGCGCGG CGATTTCCAA
CTGAACATCG AGACTATCCG CCACGCCGGA CTCGGTTCGC TGTTTGAAGC TTTCGAGCAA
CTCAAGGCGA GGCTTGGAAA AGAGGGGTTG TTCGATTCTG AGCGCAAGAA ACCATTGCCA
GAGTTTCCAA AGCAGATCGG GATCATCACT TCTCCTGCCG CCGCGGCGCT GCATGATGTG
CTGTCCACCT TGCAGCGGCG TATGCCTTCC GTGCCCATAA TCGTTTATCC AACGATTGTT
CAGGGCGCTG GCGCTTCAGT AAGGATCGCG GGCGCCATTC AAACCGCTGC AAGCCGGGCT
GAGTGCGATG TACTGATACT GTGCCGCGGG GGTGGCTCTC TGGAAGATCT GTGGGCTTTC
AACGAGGAGG TTGTGGCACG CGCAATCGCG GCTTGTTCCA TTCCCATTGT CAGTGGAGTG
GGTCACGAAA CCGATTTTAC CATTGCGGAT TTTGTTGCGG ATGTCCGCGC GCCCACGCCG
ACTGGCGCAT CCCAGCTCGT GTGCCCGGAT CGCGCAGAAG TGGCGAGATG CGGAGAAATT
CTTCGTGGAC GCATGTACCG CGCGATGCAA CGGCGCATCG AAAGCCGGAT GCAGCATACG
GATATGCTGG GGTGCCGTCT GGTACATCCG GGAAAGCGCA TCGAAGCACA ACTGGCGCAG
CTTGCGCGTT TGCGCGAACG CCTGGAAAGC GCATGGCTAC GTCACGCGAA AGAGAGGCAC
TGGCGCTTGC GCGAGCTCCA GCAGCGCATG AAGATTGCCC GGCCCGACAT CCCACGGCTG
GAAGGGCGCC AGCAGCAACT CGGTCTACGC CTTCAGCGGG CGATCGCATC CCGGATTGAA
ACTCTCGGCA TGCACTTGCA GCGCAGGGAA GCAAATCTTT CCCATCTGAA TCCGGATTCC
GTTCTGGCGC GAGGCTACAG TATTGCTTAT ACCTCCGATG GCACGGTATT GAGAAGAAAT
GATCAGGTCG ATGTTGGCGA TGTCATCCGC GTGACGTTTG CGAAAGGATG GAGCAAGGCG
TCCGTGATGG AGAAGGGCGA GTAG
 
Protein sequence
MNHVLEMGFE RAVMSVSELN RNAKELLEQA FPLSWVAGEI SNIKCYGSGH WYFSLKDEIA 
QVRCVMFREK NQYLDWQPRD GMRVEVRALV TLYHARGDFQ LNIETIRHAG LGSLFEAFEQ
LKARLGKEGL FDSERKKPLP EFPKQIGIIT SPAAAALHDV LSTLQRRMPS VPIIVYPTIV
QGAGASVRIA GAIQTAASRA ECDVLILCRG GGSLEDLWAF NEEVVARAIA ACSIPIVSGV
GHETDFTIAD FVADVRAPTP TGASQLVCPD RAEVARCGEI LRGRMYRAMQ RRIESRMQHT
DMLGCRLVHP GKRIEAQLAQ LARLRERLES AWLRHAKERH WRLRELQQRM KIARPDIPRL
EGRQQQLGLR LQRAIASRIE TLGMHLQRRE ANLSHLNPDS VLARGYSIAY TSDGTVLRRN
DQVDVGDVIR VTFAKGWSKA SVMEKGE