Gene Nham_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2020 
Symbol 
ID4031401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2245282 
End bp2246910 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content58% 
IMG OID637970477 
Productphage terminase 
Protein accessionYP_577278 
Protein GI92117549 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCAA AGAGTACGTT CCCCGAGTGG ATATATGACG GCTCGGAGAT CCCCGATCCA 
TTCGGCTACG GCGAGCGTGC CGTTACGTTT CTTCGGCGGC TGAAGCATCC GAAGTCGACG
TTGCCGGGCA GGGCGTTCCA GCTTGATCCG TGGCAAGAGC GTATCGTGCG CCGCATTTAC
GGCCCGCGGC ATGATGACGG GCGGCGCATC GTCAACACCG TCACCATGCT ACTGCCAAGG
GGAAACCGTA AGACGAGCCT TGGCGCGGCG CTGGCTTTGC TGCACACGAT CGGCCCCGAG
CGTATGCCCG GTAGCGAAGT CATTTTTTCG GCTTCGGATC GTAAGCAATC CGGCATCGCC
TTCAAAGAGG CGCGCGGCAT AGTGCAGGCC GATAAGCGGC TTGTGAAAGC GACGAAGGTT
TACGACGCCT TCAACAGCGC AAAGAAAATT GCCTACCCGA AGGATAGTGT CGAGCTGGAG
ATTATCTCGG CCGACGCGCC ATCGTCTGAA GGCCGCACGC CTGCTTTTGT GCTCGCTGAC
GAGACGCATA TCTGGCGCGG CAAGGATTTA TGGACTGTTC TGACCAACGG CCTTGATAAG
ATCGATAACA GCTTGCTTGT CGTCACGACC ACCGCAGGCC GCGGCACCGA CAATATCGGT
TATGAGATTA TCGACCGGGC ACGAAAGATT GCGCGCGGCG AGATCGTCGA CCCGACCGTG
TTGCCGGTGT TGTTCGAGGC CGAACCGGAC TGCGATTACA CAAGCGAGGA AGTTTGGCGG
CGCGTTAATC CGGGTAGTCC GCACGGCTAT CCCTCGATTG AGGGATTTCG CCGTCACGTC
AAACGCGCTC AAGACAATCC GACCGAGCGC AGCAGCTTGA AGCGATACAA ACTCAACATC
TGGGAAGATA GCAGTTCGTC GCCGTTCGTC GACATGCTCG TTTATGACGA GGGCGCTGGC
GAGATTGATA CCGCTATCCT TGACGGCGAG CCATGTTGGC TGGGCGTTGA CCTTAGTTCT
AGTATCGATC TTTCAGTCGT TATCGCGTGC TTCCGTGACG GCGACGACTA CATCGTGCAG
CCGCACTTCT TTTGTCCGCA GGACAACTTG CGACAACGCC AAGAGGCAAC CGGTGCTCCC
TATATCGAAT GGGCGCGCAA AGGGTTGATA ACGGCCACGC CCGGCAACGT GATCGACTTC
CGGGTTGTCG AGGATCGCAT CAGGGAGCTT TGCGAGACCT ACAGCGTGCA GGAGATCGCG
TGCGATCCTG CGATGGCGCG TAATCTGCTC AACAATCTCA TAGAGGATGG TCTCCCTGCG
ATCGAACATC GCCAGGGCAG CTTGAGCATG ATGCCGGCAA TCGCCGAATT GCAGCGCGCG
ATTATCGGTC GGAAGTTCAA GCACGGTGGT CATCCGGTGC TGCGATTCTG CTTCGCCAAT
GTCGAAGCGG AGACGAATGC AGCCGGACAT ATCGTGAGAT TCACGAAACA GAAGAAATGG
CTATCGATCG ATGGCGCGCA GGCCAGCGCC ATGAGCGTCA ACAGAGCATC CGCAGGCGGC
AGCGCGGCGA CAACATCGCT TTATGATGAC CCGGAATGGG AAACAGCTTT GAAGGGATTC
AATGCATGA
 
Protein sequence
MTAKSTFPEW IYDGSEIPDP FGYGERAVTF LRRLKHPKST LPGRAFQLDP WQERIVRRIY 
GPRHDDGRRI VNTVTMLLPR GNRKTSLGAA LALLHTIGPE RMPGSEVIFS ASDRKQSGIA
FKEARGIVQA DKRLVKATKV YDAFNSAKKI AYPKDSVELE IISADAPSSE GRTPAFVLAD
ETHIWRGKDL WTVLTNGLDK IDNSLLVVTT TAGRGTDNIG YEIIDRARKI ARGEIVDPTV
LPVLFEAEPD CDYTSEEVWR RVNPGSPHGY PSIEGFRRHV KRAQDNPTER SSLKRYKLNI
WEDSSSSPFV DMLVYDEGAG EIDTAILDGE PCWLGVDLSS SIDLSVVIAC FRDGDDYIVQ
PHFFCPQDNL RQRQEATGAP YIEWARKGLI TATPGNVIDF RVVEDRIREL CETYSVQEIA
CDPAMARNLL NNLIEDGLPA IEHRQGSLSM MPAIAELQRA IIGRKFKHGG HPVLRFCFAN
VEAETNAAGH IVRFTKQKKW LSIDGAQASA MSVNRASAGG SAATTSLYDD PEWETALKGF
NA