Gene Nham_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2014 
Symbol 
ID4031395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2241144 
End bp2242394 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content61% 
IMG OID637970471 
Productphage integrase 
Protein accessionYP_577272 
Protein GI92117543 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTATT TTTCGATGCT AACGGATACT GCGATCAGGA AGGCAAAACC GGGCGACAAG 
CCCTTCAAGA TGGCCGACTC TGGTGGCCTT CACCTCTATG TCTCGACCGC GGGCGGCAAG
CTTTGGCGAT TCCGCTATCG GTACGCCGAC AAAGAAAAGC TCCTCACTAT CGGTCCTTAC
CCGGATATCA GCCTCGTTGA TGCCCGTGCG GCCCGAGACG CCGCCAAGGC ATCCTTGCGC
GACGGCCGCG ATCCCGGCGT CATCAAGAAA CTGCGGAAGC TCGCCAACGT CACCAGCACC
GCCAACACGT TCGAAGCGAT CGCCCGCGAA TGGTACGACC TGAATAAGGG CCAATGGGTC
GAGCGCCATG CCGACGATGT ACTCACCAGC CTTGAGCGCG AAGTCTTCCC TGTGCTCGGC
AATATCCCTG TGGCCGATAT CAAGGCGCCC GAAGTCTTGG CTGTGCTGCG CGGTATCGAG
GCTCGCGCGA AGGAAACGGC GCGGCGAGTC CGGCAGCGCA TGTCGGCGGT GTTCGTCTAC
GCAATCAGCT CGGGCCGCGC CGACGCCGAT CCGGCCGCGA CCGTCCAGAA GGCCATGGCT
CCGATGGTCA AGGGACGCCA GCCTGCCATT ACCGATCTGG ATGCCGCCCG CGAGATGCTG
GGGAAAGCCG AAGCCGAGAA AGCGCATCCA GCTACAAAGC TAGCCCTACG AATTATTGCC
CTGACTGTCG TTCGTCCGGG AACGCTCATC ACCACGCCAT GGTCGGAGTG GACCGATATG
GAAGATGGCG TCTGGCGCAT CCCGGCGGCG CGGATGAAAC TGCGGCTGCA ACACAAGGAT
GATGATGCTC GGGACCATTG GGTGCCGCTA TCAAGGCAAG CCTTGGAGGC TGTCGAAGCG
CTGCGCACCC TGACCGGCCG CGGTCCGATC CCCTTCCCGA ACACCCGCCA CGCGCACAGG
ACCATGTCAG AGAACGCCAT CGGGTATCTG TTGAACCGTG CGGGGTATCA CCACCGCCAT
GTCCCCCACG GATGGCGGGC AACATTTTCC AGCGTGATGA ATGAGCGGTT TCCGGCTGAC
AAACCGATTA TCGACCTGAT GCTGGCTCAC GTCCCGAAGG ACAAGGTCGA AGGTGCTTAC
AATCGCGCCC TGCATCTGGA ACGCCGCCGA AAGTTGGCGC AGGAATGGGC GGACTTGATC
TTGAAGGATG CGCGGCCCGC TGCTGATTTG CTGGTCGGAC CGAGGAAGTA G
 
Protein sequence
MGYFSMLTDT AIRKAKPGDK PFKMADSGGL HLYVSTAGGK LWRFRYRYAD KEKLLTIGPY 
PDISLVDARA ARDAAKASLR DGRDPGVIKK LRKLANVTST ANTFEAIARE WYDLNKGQWV
ERHADDVLTS LEREVFPVLG NIPVADIKAP EVLAVLRGIE ARAKETARRV RQRMSAVFVY
AISSGRADAD PAATVQKAMA PMVKGRQPAI TDLDAAREML GKAEAEKAHP ATKLALRIIA
LTVVRPGTLI TTPWSEWTDM EDGVWRIPAA RMKLRLQHKD DDARDHWVPL SRQALEAVEA
LRTLTGRGPI PFPNTRHAHR TMSENAIGYL LNRAGYHHRH VPHGWRATFS SVMNERFPAD
KPIIDLMLAH VPKDKVEGAY NRALHLERRR KLAQEWADLI LKDARPAADL LVGPRK