Gene Nham_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2078 
Symbol 
ID4031252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2311992 
End bp2317208 
Gene Length5217 bp 
Protein Length1738 aa 
Translation table11 
GC content65% 
IMG OID637970535 
Productalpha-2-macroglobulin-like 
Protein accessionYP_577335 
Protein GI92117606 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGGA TGGTTCGTAC CGTAGTCCTC TGCGTGGCGC TGGCGATTGG GTTTGTTTCG 
GCGCAGGCGG CGGATAAGGT TTTCAAGCGC GACGATCTCG CCGACGCGGC GATCAAGCTC
GAAGCCCAGA TCAAGAGCGA AGCCGGCACC ATCGCCAAAT CGGCCCCGAC GCTGCGGACC
GATGCCGACG CCGCCTTCAA GCGATCCGAT TTTCGCAGCG GCCTTCATAT CCTCGGCCAG
ATCGCGACCG TCGCTCCCGA CGACAGCGGC AACTGGCTGC GGCTGGCGCG GACGATTTTC
CAGATCAAGC CGACCACGAG CCAGGAACAG ACCTTTCTGC GGGAGCGGGC CTCCACCGCC
GCCTATATCG CCTATCAGCG GGCGCGTGAT CCCGGTGCCG AAGCTGATGC ATTGGCGATG
CTCGGGCGCG CTTTCTCGGA ACGCAAGCTG TGGCGGCCGG CGCTCGATAC GCTGGGTCTG
TCGCTCGATC TGCGCGAGGT CGCCGACGTT CGCGAGCAGT ACGAAAAGAT GCGCGACGAC
CACGGCTTCC GGCTGCTCGA TTACACCGTC GATTCGGATT CGGCCTCGCC GCGAGTCTGC
TTTCAGTTCT CGGAGGACCT TGCCAAGCGG ACCGACTTTT CACCGTTCGT AGCGCTAGCC
GGCGCCGACC GGCCGGCGCT GTCGTCGGAA GACAAACAGC TTTGCGTCGA CGGCCTCAAG
CACGGCGAGC GCTACAATAT CAACCTGCGC GCCGGCCTGC CGTCGACCGT CAAGGAAAGC
CTGCCGAAAT CCGCTGAATT CAATGTCTAT GTGCGGGATC GCAAGCCGTT CGTCCGCTTC
ACCGGGCGTG CTTACGTGCT GCCGCGCACC GGCCAGCGCG GCATTCCGCT GGTCAGCGTG
AATACCCAGG CGGTGTCGGC GCAGGTATTC CGGATCGGCG ACCGCAATCT CATCAACACC
GTGATCGACA GCGACTTCCA GAAGACGCTC AGCGGCTATC AATTGTCCGA TATCGGTAAC
GAGCGCGGGG TCAAGGTCTG GTCGGGCGAG GTTACGACGG CCTCCACGCT GAACGCCGAT
GTGACCACCG CGCTTCCTGT GGATGAGGCG CTCGGGAACC TCCAGCCCGG CGTCTATGTG
ATGACGGCCG CGCCCAAGGG TCCCGGATCG GCGAACGACG AGGAGTCCGG CTCGCTGGCG
ACCCAGTGGT TCATCGTCTC CGATCTGGGC CTGACCGCTT ATTCCGGCAA TGACGGCATC
CATGTCTTCG TCAATTCGCT GGCGACGACC GATGCGGTGG GCAAGGCGGA AGTCCGCCTG
ATCGCGCGCA ACAACGAAAT CCTTGCGACC AGAAAGACCG ACGAGTCCGG TCATGCGCTG
TTCGAGCCGG GTCTCGCGGG AGGCGAGGGC GGACTGTCGC CGGCGTTGCT GACGGTCAGC
ACCGATAAGG CCGATTATGC GTTCCTGAGT TTGAAGTCGA GCGCGTTCGA TCTCACCGAT
CGTGGTGTGT CGGGTCGGGC AGTGCCTGCC GGGGCCGATG CGTTCGTTTA CGCCGAGCGT
GGCGTCTATC GCTCCGGGGA GACTGTCTAT CTCACGGCGC TGCTGCGCGA CGGGCAGGGC
AATGCCGTGA CCGGCGGGCC GTTGACGCTG GTGGTGGAAC GGCCCGATGG GGTCGAATTC
CGCCGTGCCG TCCTGGCCGA TCAGGGCGCG GGCGGGCGAA GCCTGACCCT GCCGCTCAAT
TCGGCGGTCC CGGCCGGGAC GTGGCGGGTT CGCGCGTTTA CCGACCCAAA GGGGCCGTCG
GTCGGCGAAA CCACCTTCAT GGTCGAAGAC TATGTGCCGG ACAGGATAGA ATTCGACTTA
ATGACCATGG CGAAGCAGAT CGATGCTCAA GCTCCGGTAG AACTTAAGGT TGACGGCCAT
TTCCTCTACG GTGCGCCGGC ATCCGGCCTG CAACTCGAAG GCGACATGCT GGTGGCTCCT
GCAGCCAGCC GTCCGGGTTA TGCCGGATAT CAGTTCGGGG TCGCCGACGA CGAGACCACC
AGCAACGAGC GCACGCCGAT CGAAAACCTG CCCGAAGCCG ACGCCAACGG TGTTGCCACG
TTTCCGGTCA GCCTCGCGAC GCCGCCGTCA TCGTCCCGCC CGCAGGAAGC GCAGATCTTC
GTTCGCATGG CCGAGGCTGG CGGACGCGCG GTTGAGCGGA AGCTGGTGCT CCCGGTCGCC
CCCGCCGCCG CCATGATCGG CGTCAAGCCG CTATTTGCCG ACAAGAACGT TGCGGAAGGC
GACAAGGCCG GTTTCGACGT TGCCTTCGTC GCCCCTGACG GTACCTCGCT TGCGCGCGAG
GGCCTGCGCT ACGAACTGCT CAAGCTCGAG AGCCGCTATC AGTGGTATCG CCAGAATTCG
TACTGGGAAT ACGAGCCGGT GAAGTCGACC AAACGGGTTG CCGACGGCGA TCTCACGATT
GCCGCCAACA AGCCGGCGCG GATCGAACTC TCTCCCCAGC CGGGACGCTA CCGGCTCGAT
GTCAAATCGT CCGATCCCAA CGGTCCGCTG ACCTCGGTGC AGTTCGACGT CGGCTGGTAT
TCCGACGGCA GCGCCGATAC GCCCGACCTG CTGGAAACCT CGATCGACAA ACAGGATTAC
CTGTCCGGCG ACACCATGAT CGTTTCGGTC AATGCCCGAG CCGCCGGCAA ACTCACGATC
AACGTGCTCG GCGACCGGCT GCTGACGACG CAGACCACCG AGGTCAAGGA GGGCACGTCG
CAGGTCAAAA TCCAGGTCGG AAAGGATTGG GGCACCGGCG CCTATGTGGT GGCGACGCTG
CGTCGGCCGC TCGACGTTGC CGCCCAGCGT ATGCCCGGCC GCGCAATCGG CATCAAATGG
TTCGGCATCG ACAAGACGGC GCGTACGCTG TCGGTCAACC TGTCGCCGCC GACATTGGTG
CGGCCGTCGA CGACGTTGAA GCTGCCTGTG AAGATCGGCG GTCTAAGCCC CGGCGAAGAC
GCCAAGATCG TCGTTGCCGC CGTCGATGTC GGCATTCTCA ACCTCACCAA TTACAAACCA
CCCGCACCCG ACGACTACTA TCTCGGCCAG CGCCGCATGA CCTCGGAAAT CCGTGATCTT
TACGGGCAAC TGATCGATGG CATGCAGGGC ACGCGCGGCC AACTCAGGAC CGGCGGCGAT
TCGGCAGGAG CGGAGTTGCA GGGCAGCCCG CCGACGCAGA AGCCGCTCGC GCTCTATTCC
GGCATCGTCA CGGTGGCCGC GGACGGCACG GCCGAGATCA GCTTTGACAT TCCGGAATTC
GCCGGCACCG CACGGGTGAT GGCGGTTGCC TGGAGCGCAA CCAAGCTTGG CCGCGCGACG
GTCGATGTCA CCGTGCGCGA TCCGGTGGTG CTGACGGCGA CCCTGCCGCG CTTCCTGCTG
ACCGGCGACC AGGGCACCAT GAGCTTCGAT CTCGACAACG TCGAGGGTGC GCCCGGCGAC
TACACCGTCA ACGTCAAGGC CTCGGGGCCG GTGACGGTGG CGGGCAATCC CGCGACCACG
ATCACGCTCG CGGCCAAGCA GCGCAGCTCG ATGGCGCTGA CGCTCAATGC TGGCGGCGCC
GCCGGCGCCG CGCAATTCGA CGTCGATATC AAGGGGCCGA ACGGTCTGAC GCTGGCGCGG
CACTATGATC TCGACGTCAA GCCCGCGACC CAGATACTGG CCCGACGCTC GGTCCGCACG
CTGGCAAAAG GCGAGAGCCT GACGCTGACC TCGGACATGT TCTCCGACCT CGTGGCGGGA
ACGGGCGGGG TGTCGATGTC GGTCGGCCTG TCGTCCGCGC TGGATGCGGC GACCGTCCTG
AAGGCACTCG ACCGTTATCC CTTCGGCTGT TCCGAGCAGA TCACCAGCCG GGCGATGCCG
CTACTCTATG TCAACGATCT CGCGGCCGGA GCACACCTCG CGATGGATAC CGGCATCGAT
GAGCGCATCA AAAGCTCGAT CGACCGTCTG CTGGCCCGGC AAGGCTCGAA CGGCTCGTTC
GGCATGTGGT CATCCGGCGG CGACGACGCG TGGCTCGACA CCTATGTCAC CGACTTCCTG
ACCCGCGCCC GCGAGAAGGG ATTTGCGGTA CCAGAAACCC TGTTCAAGAG CGCGCTCGAT
CGCATCCGCA ACTCGGTTGT GAATGCCTCC GAACCAGAGA AGGATGGCGG CCGCGAACTG
GCTTACGGGC TTTATGTTCT CGCCAGGAAC GGTGCCGCTC CGATCGGGGA TCTGCGCTAT
CTGGCCGATA CCAAGCTGAA CAATCTGGCC ACCCCGATCG CCAAGGCGCA ACTGGCCGCG
GCGCTAGCGC TGGTCGGCGA CCGGGCGCGT GCAGAACGCG TGTATGCGGC GGCGGCTGAA
AGTCTGGCGC CGAAACCGGT CGTCGAATTC GGCCGGACGG ACTACGGCTC GGCCCTGCGC
GATGCCGCGG CGCTGGTGTC GCTGGCGAGC GAGGGCAACG CGCCGCGAGC GACGGTGACG
CAGGCGGTGC AGCGGGTCGA GGCGGCGCGG GGACTGACGC CCTTTACCTC GACGCAGGAG
AACGCGTGGC TGGTTTTGGC GGCGAGGGCG CTTGCGAAAG AATCCATGTC GCTCGATGTC
GATGGCACAC CGGTCAAAAC CGCGCTCTAT CGCAGCTACA AGGCAGCCGA GATGGCTGAC
AAGCCGATCA AGATCGCCAA TACCGGCGAT GCGCCGGTGC AGGCGGTGAT TTCCGTGGCC
GGCGCGCCGA TGACCCCGGA GCCGGCTGTC TCGAACGGTT TCGAGATCGA ACGAAACTAT
TTCACGCTCG ACGGGACGCC GGCGGATCCG ACACAGGCCA GGCAGAACGA CAGGTTGGCC
GTCGTTCTCA GGATCACCGA GGCCAAGCCG GAATACGGTC ACATCATGGT GGCCGACTAT
CTTCCGGCAG GTTTCGAAAT CGACAATCCG CACCTGGTCT CGTCCGGCGA CGCCGGCACC
CTCGACTGGA TCGAGGAGGG CCAGCAGCCG GTCAATACGG AATTCCGCGA CGATCGGTTC
ACTGCGGCCT TCGATCGCGC CGGTAACGAC AAGGCGGTGT TCACCGTCGC CTATGTCGTG
CGGGCGGTTT CGCCGGGCAA GTATGTTTTG CCGCAAGCCT ATGTCGAGGA CATGTACAAT
CCCTCGCGCT ACGGCCGCAC CGGCACCGGT CGCGTCGAGG TGCGTCCCGC GAAATGA
 
Protein sequence
MTGMVRTVVL CVALAIGFVS AQAADKVFKR DDLADAAIKL EAQIKSEAGT IAKSAPTLRT 
DADAAFKRSD FRSGLHILGQ IATVAPDDSG NWLRLARTIF QIKPTTSQEQ TFLRERASTA
AYIAYQRARD PGAEADALAM LGRAFSERKL WRPALDTLGL SLDLREVADV REQYEKMRDD
HGFRLLDYTV DSDSASPRVC FQFSEDLAKR TDFSPFVALA GADRPALSSE DKQLCVDGLK
HGERYNINLR AGLPSTVKES LPKSAEFNVY VRDRKPFVRF TGRAYVLPRT GQRGIPLVSV
NTQAVSAQVF RIGDRNLINT VIDSDFQKTL SGYQLSDIGN ERGVKVWSGE VTTASTLNAD
VTTALPVDEA LGNLQPGVYV MTAAPKGPGS ANDEESGSLA TQWFIVSDLG LTAYSGNDGI
HVFVNSLATT DAVGKAEVRL IARNNEILAT RKTDESGHAL FEPGLAGGEG GLSPALLTVS
TDKADYAFLS LKSSAFDLTD RGVSGRAVPA GADAFVYAER GVYRSGETVY LTALLRDGQG
NAVTGGPLTL VVERPDGVEF RRAVLADQGA GGRSLTLPLN SAVPAGTWRV RAFTDPKGPS
VGETTFMVED YVPDRIEFDL MTMAKQIDAQ APVELKVDGH FLYGAPASGL QLEGDMLVAP
AASRPGYAGY QFGVADDETT SNERTPIENL PEADANGVAT FPVSLATPPS SSRPQEAQIF
VRMAEAGGRA VERKLVLPVA PAAAMIGVKP LFADKNVAEG DKAGFDVAFV APDGTSLARE
GLRYELLKLE SRYQWYRQNS YWEYEPVKST KRVADGDLTI AANKPARIEL SPQPGRYRLD
VKSSDPNGPL TSVQFDVGWY SDGSADTPDL LETSIDKQDY LSGDTMIVSV NARAAGKLTI
NVLGDRLLTT QTTEVKEGTS QVKIQVGKDW GTGAYVVATL RRPLDVAAQR MPGRAIGIKW
FGIDKTARTL SVNLSPPTLV RPSTTLKLPV KIGGLSPGED AKIVVAAVDV GILNLTNYKP
PAPDDYYLGQ RRMTSEIRDL YGQLIDGMQG TRGQLRTGGD SAGAELQGSP PTQKPLALYS
GIVTVAADGT AEISFDIPEF AGTARVMAVA WSATKLGRAT VDVTVRDPVV LTATLPRFLL
TGDQGTMSFD LDNVEGAPGD YTVNVKASGP VTVAGNPATT ITLAAKQRSS MALTLNAGGA
AGAAQFDVDI KGPNGLTLAR HYDLDVKPAT QILARRSVRT LAKGESLTLT SDMFSDLVAG
TGGVSMSVGL SSALDAATVL KALDRYPFGC SEQITSRAMP LLYVNDLAAG AHLAMDTGID
ERIKSSIDRL LARQGSNGSF GMWSSGGDDA WLDTYVTDFL TRAREKGFAV PETLFKSALD
RIRNSVVNAS EPEKDGGREL AYGLYVLARN GAAPIGDLRY LADTKLNNLA TPIAKAQLAA
ALALVGDRAR AERVYAAAAE SLAPKPVVEF GRTDYGSALR DAAALVSLAS EGNAPRATVT
QAVQRVEAAR GLTPFTSTQE NAWLVLAARA LAKESMSLDV DGTPVKTALY RSYKAAEMAD
KPIKIANTGD APVQAVISVA GAPMTPEPAV SNGFEIERNY FTLDGTPADP TQARQNDRLA
VVLRITEAKP EYGHIMVADY LPAGFEIDNP HLVSSGDAGT LDWIEEGQQP VNTEFRDDRF
TAAFDRAGND KAVFTVAYVV RAVSPGKYVL PQAYVEDMYN PSRYGRTGTG RVEVRPAK