Gene Nham_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1939 
Symbol 
ID4030542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2156321 
End bp2157445 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content62% 
IMG OID637970402 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_577204 
Protein GI92117475 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.143032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTCCTGC CCTGCCAGCA AGCGAAGCTG GCAAGGGGCG CCAAAGACAA GGAGCACGCC 
ATGTCTCAGA CACCCAATAC CGCGATCGCC GTGATCGGCA TCGATATCGG CAAGAACTCG
TTCCACGTCG TGGGCCACGA TGCGCGCGGC GCCATCGTGC TGCGGCAAAA GTGGTCGCGT
GGCCAAGTGG AAGCGCGGCT CACCAATATA CCGCCTTGCC TGATCGGCAT GGAAGCCTGC
GTCGGCGCAC ATCACCTGAG CCGCAACCTC GCATCGCTTG GTCACGATGC CAGGTTGATG
CCGGCCAAAT ATGTCCGCCC CTATAGCAAG GGACAGAAGA ACGACTTCAA TGATGCCGAA
GCGATTGCCG AAGCCGTGCA GCGCCCGACG ATGAAGTTCG TGGCGACCAA GACCGCGGAA
CAACTGGATC TGCAGGCGCT GCATCGGGTG CGCGAGCGGC TGGTGTCGCA ACGCACCGGC
CTCATCAACC AGATTCGCGC CTTCATGCTG GAACGCGGAA TCGCCGTGCG CCAGGGTATC
GGCTTCCTGC GCACGGAACT GCCCACCATC CTTGCGACGC GCACTGATGC CCTGTCGCCA
CGCATGTTGC GTGTCATCGA GGAGTTGGCA GGCGACTGGC GTCGGCTGGA TCAGCGCATC
GATGGCCTAT CCGGCGAGAT CGAAGCACTG GCCCGTCAAG ATCAGGCATG TTCGCGCCTG
ATGACGGTGC CTGGCATCGG ACCGATCATT TCGAGCGCCA TGGTGGCCGC GATCGGCACT
GGAGACGTAT TCTCCAAAGG CCGCGACTTC GGCGCCTGGC TCGGACTGGT GCCCAAGCAG
ATTTCGACGG GAGACCGCAC GATCCTCGGC CAAATCTCGA GGCGCGGCAA TCGCTACCTG
CGCGTTCTAT TTGTGCAGGC GGCATGGGTT GTGCTGGTCA GGATAAAGAA CTGGGAACGT
TACGGGCTCA AATCCTGGAT CGAAGCTGCC AAGAGGCGGT TGCACCACAA CGTGCTGGCG
ATCGCGCTCG CCAACAAGCT TGCCCGCATC GCCTGGGCGG TGCTGGCTAA AGGACGCGCC
TTCGAGTTGA CGAGGACCGA CGATGCAGGC GTCCGACCCG CTTGA
 
Protein sequence
MLLPCQQAKL ARGAKDKEHA MSQTPNTAIA VIGIDIGKNS FHVVGHDARG AIVLRQKWSR 
GQVEARLTNI PPCLIGMEAC VGAHHLSRNL ASLGHDARLM PAKYVRPYSK GQKNDFNDAE
AIAEAVQRPT MKFVATKTAE QLDLQALHRV RERLVSQRTG LINQIRAFML ERGIAVRQGI
GFLRTELPTI LATRTDALSP RMLRVIEELA GDWRRLDQRI DGLSGEIEAL ARQDQACSRL
MTVPGIGPII SSAMVAAIGT GDVFSKGRDF GAWLGLVPKQ ISTGDRTILG QISRRGNRYL
RVLFVQAAWV VLVRIKNWER YGLKSWIEAA KRRLHHNVLA IALANKLARI AWAVLAKGRA
FELTRTDDAG VRPA