Gene Nham_0248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_0248 
Symbol 
ID4030996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp275056 
End bp278316 
Gene Length3261 bp 
Protein Length1086 aa 
Translation table11 
GC content69% 
IMG OID637968784 
ProductSel1-like 
Protein accessionYP_575608 
Protein GI92115879 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGC GCGGATCGTG GAGCGAAGAC GGGATCGAAC CATCCGTTCG CGAGCGGGCC 
GAAGCCGCTG CGCGGCGCGC CGGCGTGTCC CTGAACGAGT GGCTCAGTTC CACGGTCGGC
GGCGCCATGC CGGATTCCCG GTTGGCGCAG TCCCCGGTCC CGAGCCAAGA CAGCCACCAG
GACAACCACG ACGTCGCGGA CATCCATCAG CGGCTCGATT CCATCACCCG CCAGATCGAT
CAGTTGTCGA GTCCGGCCAC GCGCGGCGAA CCCGCCGTCG CCCGCCAGCT CAACGACGCC
ATTTCGCGGC TCGATGCCCG GCTCGCCCGG GTCTCCGCCC AGGCACCCGC AGATGACTCG
CAGCACCGCG CCGACAGGGT CGAGCGCGCC GCAGCCGAGG TCTACAGCCG GTCGCCGAGG
CCCGATGTCG CCTCGCTCGA ATTCGCGATC GCGGAAGTCG CCGCGCGCCA GCATGAGCTC
GACGGCGCAG GCGCGATGCC GCCGCGCAGT TCACCGCCGG TCGTGCCGGC GATGACAACG
TCGCGGCCGG CGCCGGATTT TTCCGGCCTC GAACAGCAAC TCTTCAAGAT CACGAGCCAG
ATCGAATCGC TGCAGCGCCC GGACGGGATC GAGCAGTCGA TCGCCGCATT CCGCAGCGAG
CTTGCCGGCA TTCGTCACGT CATTACCGAG GCGTTGCCAC GCCGCGCGAT CGAATCGATC
GAAAACGAGA TTCGCTCGCT ATCGCAGCGC ATCGACGACG TCCGCCAAAA CGGCAGCGAT
GGCCAGGCGC TGGCGAATAT CGAACGCGCC CTCAATGAGA TCTACGACGC GCTGCGATCG
CTGAAACCGG CGGAACAGCT TGCAGGATTC GACGAGGCGA TCCGCAATCT CGGCAACAAG
ATCGATACGA TCGTGCGCAG CAGCGGCGAC AGCGGCATGA TGCAGCAGCT CGAGAACGCG
ATCGGCGCGC TTCGCGGCAT CGTCTCCAAC GTCGCCTCCA ACGATGCGCT GGCGCGGCTC
AGCGACGATC TCACCCTGCT GTCGTCGAAA GTCGATCAGC TCGGTCGATC CGAGGGCAAC
AGCGATTCGT TCGCCGCGCT CGAACAGCGC GTCGTCGCCG CGCTGACGGC GACCCTGGAA
AACCGCGAAC GTCCCGCCTC CGGCGGCAGT TCCGAGCAGC TTGAAGAGGC GGTGCAGGCG
CTGTCCGACC GTCTCGACAG TTTGCCGGCC GGCCACGACT CGTCATCGGC GCTGGCGCAT
CTCGAACAAC GGGTCTCGCT GCTGCTCGAA CGCCTGGAGA CCGCCGGCGG TCATTCGGGA
ACCAATCTCG CAGGAACCAA CCTCGGGCGC GTCGAGGAAG GCTTGCAGGA CATCCTGCGC
CACCTGGAAC GGCAGCAGGC CGGGCTCGCG GCGCTGACCG AAAGCGGCCC GCGCAGCACC
GGCCCGACCA TGGACAGCGA GGTCGTCGAG GCGATCAAGC GCGAACTGTC CGAGATGCGA
TTCTGCCAGT CGGAAACCGA CCGCCATACT CAGGATTCAC TCGAGGCCGT TCACAACACG
CTCGGGCATG TGGTCGATCG ACTGGCGATG ATCGAAGGCG ACCTGCGCGC GGTCCGTGCG
ATGCCTGCGG CCCAGGCCGG GCCAGCTCGC GGCGCGATAC CGGAACCGCC GGCGGGTCTG
CCGCCAAGGC CCGAATTGCC GAACCCCGTG CTGTCGCAGA TGGCGGCGCC GCAGCCCGCT
GCGGCAGCGT CGGCATCAGC GTCGATCCCG CCGCGCGCGA TCGGCGACAT CCTGATTTCG
AGAGACACCT TCGACCCGGG GCACGGCCCG CAATCCGCAA CCGTGCGCCC GCCGCAACCG
CGCCCCGCGA TCGATCCGGA CCTGCCGCCC GATCATCCGC TGGAACCCGG CACCCGTCCG
GAGGGACGCC CGGCGTCGCC GTCGGAGCGT ATCGCTGCCT CCGAAAGCGC GATCGGCGAC
ATCGCCGAAC CGCCGCGCGA GCAATCGGGT TCATCATTCA TTGCTGCGGC GCGCCGGGCC
GCGCAGGCGG CGGCCGCCGC GGCGCCGTCG TCCGACAAGG CTGGCCGGAC CAAGGTCGCC
ATCGAACCGG CGCGGCCCGC CACAGGCGGG TCCAGTATCA CCTCCAAGAT CCGCTCGCTG
CTGGTCGGAG CGAGCGTGGT GGCGATCGTG CTCGGCAGTT TCCAGCTCGC AATGTCGCTG
TTCGATGGAA CCTCGCGCAC GGCCGTCAGC GAGACCAGCC CGGTCGCCGC GACCGCGGCA
AAGCCGCCGG CCGACGCCGA AGCCGCACCC GCACCCGCGA CCGCGATTCC GGCGATGCCG
TCGATGACCT CGCCGACGCC GATCGACCGC CAGTCGATGA TTTCGCCGCC GTCCGGCGGC
AGCGTGGCTC CGGCCGAATC CCCATCGGAC ACGGCGCCAT TGAGCCGGCC GGAGACCGCA
TCCACTGAGG TCACCGGCAC GATCCCGGTC GCGCCCACAT CCGTCCCGCT CGAGCGGGTC
GCAATTCCCC GTTCGGAATC CCTTCCCGAC AGCATCGGCG GCCCAATGTT GCGGACCGCC
GCGCTCCACG GCGACGCCGC GGCGGCGTTT GAAGTCGGCG TCCGCTATGC CGAAGGCAAA
GGCGTTGCCG TCAATTACGA CGAAGCCGCG AAATGGTACG ATCGCGCGGC GCAGGCCGGC
GTCGTGCCCG CCATGTTCAG GCTCGGCACC CTGCACGAAA AGGGCCTCGG CGCGAGCAAG
GACGTCGATG CGGCGCGGCG CTATTACATG CAGGCGGCCG AGCGGGGCAA TGCCAAGGCG
ATGCATAACC TCGCCGTGCT CGACGCCGAC GGCGGCGGCA AGGGCGCCAA CTACACGAGC
GCGGCGCAAT GGTTCCGCAA GGCGGCCGAG CGGGGCGTCG CCGACAGCCA GTTCAATCTC
GGCATCCTCT ATGCCCGCGG CATCGGCATC GAACAGAACC TGGCGGAATC CTACAAGTGG
TTCAGCCTCG CCGCCGCCCA GGGCGACGCG GATTCCGCGC GCAAGCGCGA CGAGGTCGCC
AAGCGCCTCG ATCCGCAGTC GCTTGCCGCG GCCAGGCTGG CGGTCCAGAC GTTCACCCCC
AAGCCCCAGC CCGACGATGC CGTCAAGGTC GTAAGCCCCG CCGGCGGATG GGACGGCGCA
GCCGCCATGC CCCAGGCCAA AACCAAGACG GCCCGGTCCG CGACGGCCAA ATCTGCGATG
GCCAAGCAGG CCGTGCGTTA A
 
Protein sequence
MNSRGSWSED GIEPSVRERA EAAARRAGVS LNEWLSSTVG GAMPDSRLAQ SPVPSQDSHQ 
DNHDVADIHQ RLDSITRQID QLSSPATRGE PAVARQLNDA ISRLDARLAR VSAQAPADDS
QHRADRVERA AAEVYSRSPR PDVASLEFAI AEVAARQHEL DGAGAMPPRS SPPVVPAMTT
SRPAPDFSGL EQQLFKITSQ IESLQRPDGI EQSIAAFRSE LAGIRHVITE ALPRRAIESI
ENEIRSLSQR IDDVRQNGSD GQALANIERA LNEIYDALRS LKPAEQLAGF DEAIRNLGNK
IDTIVRSSGD SGMMQQLENA IGALRGIVSN VASNDALARL SDDLTLLSSK VDQLGRSEGN
SDSFAALEQR VVAALTATLE NRERPASGGS SEQLEEAVQA LSDRLDSLPA GHDSSSALAH
LEQRVSLLLE RLETAGGHSG TNLAGTNLGR VEEGLQDILR HLERQQAGLA ALTESGPRST
GPTMDSEVVE AIKRELSEMR FCQSETDRHT QDSLEAVHNT LGHVVDRLAM IEGDLRAVRA
MPAAQAGPAR GAIPEPPAGL PPRPELPNPV LSQMAAPQPA AAASASASIP PRAIGDILIS
RDTFDPGHGP QSATVRPPQP RPAIDPDLPP DHPLEPGTRP EGRPASPSER IAASESAIGD
IAEPPREQSG SSFIAAARRA AQAAAAAAPS SDKAGRTKVA IEPARPATGG SSITSKIRSL
LVGASVVAIV LGSFQLAMSL FDGTSRTAVS ETSPVAATAA KPPADAEAAP APATAIPAMP
SMTSPTPIDR QSMISPPSGG SVAPAESPSD TAPLSRPETA STEVTGTIPV APTSVPLERV
AIPRSESLPD SIGGPMLRTA ALHGDAAAAF EVGVRYAEGK GVAVNYDEAA KWYDRAAQAG
VVPAMFRLGT LHEKGLGASK DVDAARRYYM QAAERGNAKA MHNLAVLDAD GGGKGANYTS
AAQWFRKAAE RGVADSQFNL GILYARGIGI EQNLAESYKW FSLAAAQGDA DSARKRDEVA
KRLDPQSLAA ARLAVQTFTP KPQPDDAVKV VSPAGGWDGA AAMPQAKTKT ARSATAKSAM
AKQAVR