Gene Nham_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2027 
Symbol 
ID4031408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2250722 
End bp2251942 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content58% 
IMG OID637970484 
Productphage portal protein 
Protein accessionYP_577285 
Protein GI92117556 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTGGT TCTCCCGCAA ATCCGCACCT GTTGCGGAAG TGAAATCGGA ATTGACGGAT 
CCCACTTCGG ATAGCTGGGC CGCACTCTGC GATGCCTTCG CTGTGTCTGC GTCGGGTGTT
TCTGTCAATG CTGATAGCGC CATGCGATGC GCGCCGGTCA ATGCCGCAGT CTCAATCCTT
AGCGAGTCCA CCGCAACGTT GCCGTGTCGT CTGTTACGTG ACGATGCCAA TGATAGCGAG
CGCGCAGCGA AGGGTCATCC AGCTTATTCT CTAGTCAACG GTTTCAGCAA TGAGTGGTCA
TCCGCCGCGG ATGTTCGTCG GCGCGTAACG CAAGATGCAA TCTTGTTCGG GGACGGATTC
GCATTGGTGA CGCGCACAGC CGATCGGCCG GCTGAAATCC TTTACGCGCC TCGCAGTGTC
GTCGTGTTGG AATACCAAAC AGACGGTGAG CCGCGCTATC GCATCGACGG CAAGGTCTAT
GGCCCGCGCG ATGTCATCCA TTTGCAAGTG CCGAATCCTA ACGATCCGTT GCAGCAGCGT
GGCCTTGGAC TGCTCCATAC CGGCCGCGAT GCGATCGGCC TAGCGATTTT GCTTGAGCGC
AGTTCCTCGA ATCTGTTCCG CAACAATAGC AAGCCCTCTG GGGTTTTGAG CGTAAAGGGT
AGCCTTAACG CGACCGCAGC CGCTCGCATG GCGACGGCTT GGCGCTCGGC GCATTCCGGT
GACAAGGCGG GCTCAGTCGC GATCATCGAT AACGATGGCA GCTATACCCC GATCAACTTC
ACTTCCGTCG ATAGCCAGAC GGTTGAGCAA CGCGCCTACG CCGTAAGTGA AATCAGCCGA
CTAACTCGCG TCCCGGCAAC GTTGCTTTCC GACATGTCAC GCGCGACGTG GAGCAACAGC
GTCCAGCTTG ACCTTCAGTT CATCAAGTAC GGATTGCAAC CATGGCTCCG CGCCTGGTGC
GATGCTTATG CACGTTGTTT GCTGTCACCC GACGAGCGGA CAGCGATGCA CTTCGAATTC
GACATGTCGC AGCTACTTCT CGCCGACACT GTGGCACGTG CGAATGCGTT TGCACAGTAT
CGATCAGCCG GCGTGATGAC GGCCAACGAT GTGAGGCGCG AGCTAAATCT CCCGCCGCTA
CCAGACGGCG ACGTGCTGGC GTCACCGCAC GTTCAATCCC CAGCGAATGA TAACCAACCT
CCAAAGGACC AGGCAGCTTG A
 
Protein sequence
MGWFSRKSAP VAEVKSELTD PTSDSWAALC DAFAVSASGV SVNADSAMRC APVNAAVSIL 
SESTATLPCR LLRDDANDSE RAAKGHPAYS LVNGFSNEWS SAADVRRRVT QDAILFGDGF
ALVTRTADRP AEILYAPRSV VVLEYQTDGE PRYRIDGKVY GPRDVIHLQV PNPNDPLQQR
GLGLLHTGRD AIGLAILLER SSSNLFRNNS KPSGVLSVKG SLNATAAARM ATAWRSAHSG
DKAGSVAIID NDGSYTPINF TSVDSQTVEQ RAYAVSEISR LTRVPATLLS DMSRATWSNS
VQLDLQFIKY GLQPWLRAWC DAYARCLLSP DERTAMHFEF DMSQLLLADT VARANAFAQY
RSAGVMTAND VRRELNLPPL PDGDVLASPH VQSPANDNQP PKDQAA