Gene Nham_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1052 
Symbol 
ID4031658 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1177948 
End bp1179297 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content61% 
IMG OID637969550 
ProductType I secretion membrane fusion protein, HlyD 
Protein accessionYP_576360 
Protein GI92116631 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCCCTA GCCAGGCTTC GCTCGCAACC GTTCGCCAAT TCCAGTCGGA GACCGATGCA 
ATTCGGGAGG CCGCCGAGCC CCTGGTCGCG CGCGCCACCC TGTTCGTGCT CTCCGCGTTC
CTGGTTTCGA TCGTCGCGAT CCTGTGCCTG ACGCGGATCG ATCGCGTCAT CACCAGCCTT
AGCGGCAGAA TCGTGCCGGT CGGTCATGTC AATGTCTTGC AGGCGCTGGA TCCATCGATC
ATCAAGACGA TCAACGTGCG CGAGGGCGAG CAGGTGGAAA CGGGGCAGTT GCTCGCCACG
CTCGATGCGA CCTTCACGTC CGCGGATCTG ACACAGGCGA AGTTGCAGGT CGCGAGCCTC
GAGGCGCAGG CGGCGCGCGA TGAGGCCGAA CTGAAGCAGC AACCGCTTGT ATTTGCCGAC
AACCCGGATC CCGACTTCCA AAAATACGCT GCCCTGCAGA AGGCGCTCTA CGGTCAGCGC
GTGGCCCAGT ATACCGCCCA GCTCAACAGT TTCGAGTCGA AGATCAAGGA GACCCAGGCG
ACGATCGAAA AGCTTCGTGA CGACGATGCC CGGTATCGAC AGCGCGACGA GATTTTGCAG
AAGATCGAGA CCATGCGCAC GACGCTTGCC GAGCGTGGCA CGGGATCGCG GCTCAACATG
TACATTTCCC AGGACGCCCG GCTGGAATTG CTGCGGACAC TGGAGAATGC GCACAATGGC
CTGATCGAGG CGAAAAATAC CCTGGGATCG ATAACCGCCG ACCGTGACGC ATTCAAACAG
CAGTGGTTCG CGCAATTGAG CCAGGATTTG GTGACGACGC GCAACAAGCT CGACGAAGCA
AGGGCGACCT ATGAAAAGGC GCTCAAGCAC CAGAATCTCG TGCGCTGGAC GGCTGCCGAC
CCCTCGGTGG TGCTGACGAT GGCACGGCTC TCGGTCGGTT CGGTCCTCAA GCCGGGCGAC
CCCTTCATCA CGCTCATGCC GATCGACACC AAGCTCGAAG CGGAAATCAG GATCTCTTCG
CGCGATGTCG GGTTTATCCG GGCTGGCGAT CCCTGCACCA TGAAGGTCGA TGCCTTTAAT
GCGGCGGTGC ATGGCACGGC GAGTGGAAAT GTGCGCTGGA TCAGCGAGGG CGCGTTCACC
ACGGACGACG ATGGCAAGCC GCTCGACTAC ACCTACTACA AGGCGCGGTG CTCGGTGGAC
GCGTCCAATT TCAAGGATGT GCCGAGCAAT TTTCGTCTGA TTCCAGGCAT GACCCTTCAG
GGGGACATCA ATGTCGGAAC CCGCTCCGTC GCGATGTATC TGCTCGGCGG AATGCTGAAG
GGCTTCCACG AGGCTATGCG TGAACCATGA
 
Protein sequence
MSPSQASLAT VRQFQSETDA IREAAEPLVA RATLFVLSAF LVSIVAILCL TRIDRVITSL 
SGRIVPVGHV NVLQALDPSI IKTINVREGE QVETGQLLAT LDATFTSADL TQAKLQVASL
EAQAARDEAE LKQQPLVFAD NPDPDFQKYA ALQKALYGQR VAQYTAQLNS FESKIKETQA
TIEKLRDDDA RYRQRDEILQ KIETMRTTLA ERGTGSRLNM YISQDARLEL LRTLENAHNG
LIEAKNTLGS ITADRDAFKQ QWFAQLSQDL VTTRNKLDEA RATYEKALKH QNLVRWTAAD
PSVVLTMARL SVGSVLKPGD PFITLMPIDT KLEAEIRISS RDVGFIRAGD PCTMKVDAFN
AAVHGTASGN VRWISEGAFT TDDDGKPLDY TYYKARCSVD ASNFKDVPSN FRLIPGMTLQ
GDINVGTRSV AMYLLGGMLK GFHEAMREP