Gene Nham_3850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_3850 
Symbol 
ID4030185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp4228762 
End bp4231107 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content59% 
IMG OID637972239 
Productcomplement C1q protein 
Protein accessionYP_579013 
Protein GI92119284 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCCTC TGGCCCTTGT ATTCGTATTA GCCTTCAGTG TTATTGCCAG CGCAGCTCAT 
GCGCAGGACT GGGGCAAGGT GGCGACGGTA TCGGCCACCA TGGGTGTGAG CGGCAGCCGG
TTGTGCCTGG GCGAGGCGTC CCGCGGCGAC ATCGGTTGCC CGGCCTACGC GCCGAGCGTG
ACCACGGCGG GCGACCTGGG CGTCAGCGGC ACGGTGACGG CCAACAAGTT TATCGGCGAC
GGCAGCGGGC TGCTGAACCT GTCGGCCAGC GGCGACCGGA TTGTGAGCGG CACGCTGAGC
ATGCTGGCGA TTTCCAATAC CGGCTACATC AGCCTGACCA CGGGCGCGAC CAACTGGGGC
TATTTGAGCA GCGGCGTGAA CTACCTGCCG AATCTTAAGA CGACGACGCT CTCCGCCACC
ACGGCTATCC AGGTGGGGAG CAACAGCCTC ACCTGCGGCA CCACCATTTC CGGCACCATG
CGCTACAGCG CCATCTCTTC CACCATGGAA TACTGCAACG GCAGCGCCTG GACCAGCATG
GGGCCGAGTG CCACCTCACC AGTAAGTTTC ATGGTGAAGC GGAGCGCCAA TCAAACGGTT
GCTTCCTACG TGGAGACAAA GATTCAGTTC GACCAGGAAG TGTTCGATAC AAATAACAAC
TTCAATACAT CTACCAACCG TTTCACACCG ACCGTGCCCG GAAAATACCT CATCACTCTC
TCCACCTACT GCGCCGATGC GACGACGCAA TGTAATGCCT TGATTTATAA AAATAGCAGT
TCAGCATATG CATCGTTTGC CTACACTGGC ACCACAGCTC CCCAGGCCAC GGCCATCATT
GATATGAATG GAACGACCGA TTACCTGGAA GGGTATGTCT ATAACGGGGG CGGAACCACT
CTAGCTGGCG GAGCGACTGG TAATCACTAC ACGTATTTTG ACGGAGTTCT ACTCGCGCCC
CAAGGTGGCG GCAGCGGCGG CACGGCCACA CCGGCGGGCA GCACCGATGA TGTCCAATAT
GCCAGCGGTG GCGCCTTGGC GGCCGATACG GGCAAGTTCA CCTACGCTGG CGGGCTTCTG
TCGGCGCCGA ACATTTCCAC CACCAACATT TCGCTCAGCA CCATCAACGG CGTTCTCTTT
ACCGGCGGGG CCAGCGGCGA CCGCATCGTG AGCGGCACGC TCAGCATGCT GGCGATTTCC
AGTACCGGCT ACATCAGCCT GACCACGGGC GCGACCAACT GGGGTTACCT GAGCAGCCTT
GCGAGCTTCA TTCCGGTGCT AGGCGCGAAC ACGATCAGCA GCACCAACAT CAGTGTTACG
CTTAGCCACT ATACGCCACG TGCGATTACG TCGTTTGCGG GCGCCGGCGG CAACTACATC
GTCAGTAGCA CGTCGAGCGT GTCGGCCAGC AGCGCGGGCA GCGTGAAAAT CGCCGCCGGT
GGCAAGCTGG CCATGACTAT TGTGTCGAGC GGCAACGTGG GGATTGGGAC AACTGCGCCG
AGCACTCCGC TAACGTTAGG AAACGCAAAG GCGCTGGGTT TTAACAGTAC AACGGGCTAC
AACAGCGGCT CGCTTGGTGC GGCCATATAT AAGTGGACTG ATAACAGTCT TTTTATCGAC
AACTTTGACG GAAGTGTTGT CTTCCGGCGC GCTTCATTTC TGCCCTCGAT GGTTATTGAC
CCTTCTGGCA ACGTGGGGAT TGGGACGACG GCACCAGCAA AAACTTTGGA TGTTAACGGC
GGGGCTAGCA TTGGCACTAC TGCAGGTTCC CGGGTTCAGC TTGGCACTAA CGGCAGCGTC
AATTTCATTC AATCCATCAC AACTGGGAAT GCGGCGTTTC CATTGAGTTT TTACCAAGGG
CCGGGTGAGG CCATGCGCAT TGATACAAAC GGCAACGTGG GGATTGGGAC AACAGCCCCC
TCGTATATGC TGCACGTCAA TGGTTCGGTT GCCGGGGTAG GCGCCTACAA TGCGCTTTCC
GACCGCCGAT TTAAGAAGAA TATCCATCCC GCCGATTACG GGTTGGCAGC GATTGAGAAG
CTGCGCCCGG TCACGTTTGA CTGGATAAGC CCGACCAGCC CGCAATTGCA TAACCGTCAG
TTGGGCCTGA TTGCGCAAGA GGTGCAGCCC CTGGTGCCCG AGGCGGTGAG TGTCGCTAAC
GACCCCTCGC ATACCATGAG TATTGCCTAC AGCACGCTGG TGCCGGTGCT CATCAAGGCC
GTGCAGGAGC TCAAAGCCGA CAACGACAAT TTGCGAGCAG AGCTGCGCAC GGTGAGGGAC
ACCGACCACG CCGCGATTGA ATCTCTCCAG CGTCAGCTCA ACGAGTTGAG GGCCGCGAAA
CGCTAG
 
Protein sequence
MRPLALVFVL AFSVIASAAH AQDWGKVATV SATMGVSGSR LCLGEASRGD IGCPAYAPSV 
TTAGDLGVSG TVTANKFIGD GSGLLNLSAS GDRIVSGTLS MLAISNTGYI SLTTGATNWG
YLSSGVNYLP NLKTTTLSAT TAIQVGSNSL TCGTTISGTM RYSAISSTME YCNGSAWTSM
GPSATSPVSF MVKRSANQTV ASYVETKIQF DQEVFDTNNN FNTSTNRFTP TVPGKYLITL
STYCADATTQ CNALIYKNSS SAYASFAYTG TTAPQATAII DMNGTTDYLE GYVYNGGGTT
LAGGATGNHY TYFDGVLLAP QGGGSGGTAT PAGSTDDVQY ASGGALAADT GKFTYAGGLL
SAPNISTTNI SLSTINGVLF TGGASGDRIV SGTLSMLAIS STGYISLTTG ATNWGYLSSL
ASFIPVLGAN TISSTNISVT LSHYTPRAIT SFAGAGGNYI VSSTSSVSAS SAGSVKIAAG
GKLAMTIVSS GNVGIGTTAP STPLTLGNAK ALGFNSTTGY NSGSLGAAIY KWTDNSLFID
NFDGSVVFRR ASFLPSMVID PSGNVGIGTT APAKTLDVNG GASIGTTAGS RVQLGTNGSV
NFIQSITTGN AAFPLSFYQG PGEAMRIDTN GNVGIGTTAP SYMLHVNGSV AGVGAYNALS
DRRFKKNIHP ADYGLAAIEK LRPVTFDWIS PTSPQLHNRQ LGLIAQEVQP LVPEAVSVAN
DPSHTMSIAY STLVPVLIKA VQELKADNDN LRAELRTVRD TDHAAIESLQ RQLNELRAAK
R