Gene Nham_1415 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1415 
Symbol 
ID4032333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1599481 
End bp1600734 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content66% 
IMG OID637969889 
Productphage major capsid protein, HK97 
Protein accessionYP_576697 
Protein GI92116968 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTCG ATATCACTGA CACCGCGCCG GAGCACAAGT CCGGCATCGC CGCGCGCGGC 
GACTACGACG ACTTCCGCAC GACCTTCGAG GAGTTCAAAT CCGCTAACGA CGAACGTCTG
GCGCGGCTGG AGAAAAAGCG CGGCGACGTG CTGCTGGAGG AAAAGGTCGA CCGCATCAAT
GCCGCGCTCG ACGCCCAGCA CAAGCGCATG GATGAGCTGG CTCTGAAGCA CGCGCGCCCG
GCGCTCGAAG GCCGCAGCCG CATAGCCAGC GACGCGGCGT CCCGCGAGCA CAAGAGCGCG
TTCGAGGCTT ATGTGCGCGG TGGCGAGGCG GGTGCCTTGC GCGACCTGGA GACCAAGGCG
ATGTCGGCCG GCTCCAATGC GGATGGCGGC TATCTCGTTC CGGTCGAACT CGAACACGAG
ATCGGCGAGC GACTGGCGGC AATCTCGCCG ATCCGCGCGC TCGCGTCGGT GCGCACCATC
TCGGGCAATG TCTACAAGAA GCCGTTCATG ACCGCAGGGC CGGCCACCGG CTGGGTCGGC
GAGACGGACT CGCGGACGCA GACCACCTCG CCGACGCTGG ACGCGCTGAG CTTCCCGGCG
ATGGAGCTTT ACGCCATGCC AGCGGCGACC GCGACGCTGC TCGACGATAG CGCCGTCAAC
ATCGACGAGT GGATCGCGCA GGAGGTGGAG CTGACGTTTG CGGTGCAGGA AGGCGCGGCC
TTCGTCAACG GCGACGGCAC CAACCAGCCG AAGGGTTTTC TGCAATCGGA TACGGTGGCG
AACGGCTCGT GGGTGTGGGG CAAGCTCGGC ACTATCGCCA GCGGCGGCGC GAGCGGTTTC
GCGGCGTCGA ATCCGTCCGA TGCGCTGGTG GACCTGATCT ACGCGCTGAA GGCCGGCTAT
CGCCAGAACG CCACCTTCGT GATGAACCGC AAGACGCAAG CCGCGATCCG TAAGTTCAAG
GACACCGGCG GGGCGTATCT GTGGCAGCCG CCGGCGCAGG CGGGCGGGCG CGCCTCGCTG
ATGACGTTCC CGCTGGTCGA GGCCGAGGAC ATGCCGGACG TCGCGGCGAA TTCGCTGTCG
ATTGCGTTCG GCGATTTCCG CCGCGGTTAC CTCGTGGTGG ATCGCGCCGG CGTGCGCGTG
CTGCGCGATC CGTACTCGGC CAAGCCTTAC GTGCTGTTCT ACACGACGAA GCGCGTCGGC
GGCGGCGTGC AGGACTTCGA CGCCATCAAG TTGATGAAGT TCGCAGCGAG TTGA
 
Protein sequence
MDFDITDTAP EHKSGIAARG DYDDFRTTFE EFKSANDERL ARLEKKRGDV LLEEKVDRIN 
AALDAQHKRM DELALKHARP ALEGRSRIAS DAASREHKSA FEAYVRGGEA GALRDLETKA
MSAGSNADGG YLVPVELEHE IGERLAAISP IRALASVRTI SGNVYKKPFM TAGPATGWVG
ETDSRTQTTS PTLDALSFPA MELYAMPAAT ATLLDDSAVN IDEWIAQEVE LTFAVQEGAA
FVNGDGTNQP KGFLQSDTVA NGSWVWGKLG TIASGGASGF AASNPSDALV DLIYALKAGY
RQNATFVMNR KTQAAIRKFK DTGGAYLWQP PAQAGGRASL MTFPLVEAED MPDVAANSLS
IAFGDFRRGY LVVDRAGVRV LRDPYSAKPY VLFYTTKRVG GGVQDFDAIK LMKFAAS