Gene GSU0812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0812 
Symbol 
ID2685204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp870871 
End bp873117 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content63% 
IMG OID637125484 
Productnitrogen regulation protein NtrY, putative 
Protein accessionNP_951869 
Protein GI39995918 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.439119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAC GGACCACGGA CAGCACGCCT ACAGCAACAC CAGCCGAACG GGAACGGATC 
AAGCGGCGCC GCGAAGGGAT CGTCATCGCC CTGTCGGTGC TCCTGATCCT GATCCTGACC
CGGGTGGAGA TCCACCTGTC GCGCATCAGC TCCGATGTGC CGATGGGGAG CAACTTCCTG
ATCTTCGGCG TCATCAACGT CATCATCCTG CTGATCATCC TCCTCATCTA CCTTCTGTTC
CGCAACGTGG CAAAGCTTCT GATGGAGCGA CGGGGCAAGG CCCTGGGGGC CAACCTGCGC
ACGAAACTCG TCATCGCCTT CGTGAGCCTC TCCATCATCC CCACCATGCT CCTGTTCTTT
GTCTCGGCCA CCTACGTGAA CCAGAGCATC CGCAACTGGT TCAACACCCA GATCGAAAAC
TCCCTCTCCG AATCGCTGGA GGTTGCCCAG ACCTACTACA AGAATTCGGC GGCCAACGCT
CTCTACTACG GTAACCAGAT CAGCGCCATC ATCCGGGACC AGCGCCTGCT CAACGACGAG
AACCTGCCGC GGATGAAGGA GCTGATCCGC CAGAAGCAGA AGGAGTACAA CCTGGGAATC
GTGGAGGTCT ACTCGGCCCA GAATGAGGAG CTGGTCCGCG CCGCGAACCC GGGGCTCCCG
CTGGGGGAGT TCACCAACCC TTCCTCCGAA GACATCAAGC AGGGGCTCAA CGGCAAGGAA
CTGACCCGGG TCAACACGGT GGGTAAGGCC GACCTCATCC GCGGCATCGT GCCGATCTAT
TCCACCTACC GCGCCGACGA GGTGGTGGGG GCGGTGGTGG TCAACTACTT CGTCCCCTAC
TCCCTGGTGG AGAAAATGCG GGAGATCACC GCCTCCTACG AGCAGTTCCG TCAGCTCAAA
ATCCTGAAGA ATCCCATCCG GTCCGGCTAC ATCCTGGCCC TGTTCCTGAT CACCATGGTG
ATTATCTTCC TGGCGGTCTG GTTCGGACTC CACCTGGCCA ACAGCCTCAC CACGCCGATC
CAGGAACTGG CCGAAGCGAC CCGTCAGGTG GCCGAGGGAA ACCTGAACAT CCGCCTGGGC
CAGCGGACCG ACGATGAGCT GGGCATGCTG GTGGCCGCCT TCAACAAGAT GACCGACGAC
CTGCGAAGCA ACCAGCTGGC CCTGAAAAAC GCCAACGACG AGTTGTCGCG CAGCAACCAG
GAACTGGAAC AGCGGCGGCG CCATATGGAA ATTGTGCTCC GCAACGTGGC TGCCGGGGTC
ATCGCCGTGG ACCGGGGGGG GACGATAACC ACCATCAACC CCTCAGCGGC CCGGCTCCTG
CAGATCGACA TGCCCCGGGC CGTGGGCCAC AACTTCCGGG AGGTGCTCAA GGGAGAGCAG
CTCGACATCG TGAAGGGGGT CCTGCGGGAC ATGGTCATGG CCAAGAAGGA CAGCATCAGC
CGCCAGATCA CGGTGCCGGT GCGGGACAGC CGGGCGACGT TCCTCTTCAA CCTGTCGGTC
CTGCGGGACG AGAGCGGCGA ATTCCTCGGC ACCGTGGTGG TCTTCGACGA CATGACCCAG
TTGATCAAGG CCCAGCGGAT GGCCGCCTGG CGCGAGGTGG CCCGCCGCAT CGCCCACGAG
ATCAAGAATC CGCTGACCCC GATCCAGCTC TCGGCCCAAC GGCTGCGCAA GCGCTACCTT
CCCCGCTTCG GCGACGAAGA CCGGGTCTTT GACGAGTGCA CGGCCATGAT CGTCAAATCC
GTTGACGAAC TGAAGACCCT GGTGGACGAG TTCTCCAACT TCGCCCGGAT GCCTGCGGCC
CATCCCTCCC CCAACGACCT GAACGCCGTC ATCCGCGAGG CAGTCACCCT CTTCCGGGAA
GGGCACCGCG GCGTCGCGTT CGGCTTCAGC GCCGACGACC GGCTGCCCCT GCTGCAGCTG
GACCGGGACC AGATCAAGCG GGTCTTCATC AACCTCCTGG ACAACGCCGT GGCGGCCATG
GGGGGGACGG GCGAGGTCCG GATCATGAGC CGCTTCGATC CGGAGCTGAA AATGGCGGTG
GTGACCGTGG CCGATACGGG GCCGGGCATC CCCCCGGAGG ACAAGACGCG GGTCTTCGAG
CCCTACTTCT CCACCAAGGC GTCGGGCACC GGCCTGGGGC TCGCCATCGT CAGCAGCATC
ATCACCGACC ATCACGGGTT CATCCGGGTG CGGGACAACG AGCCCCGGGG GACCAAGTTC
GTCATCGAAC TGCCCGTGAC GGGATAG
 
Protein sequence
MDKRTTDSTP TATPAERERI KRRREGIVIA LSVLLILILT RVEIHLSRIS SDVPMGSNFL 
IFGVINVIIL LIILLIYLLF RNVAKLLMER RGKALGANLR TKLVIAFVSL SIIPTMLLFF
VSATYVNQSI RNWFNTQIEN SLSESLEVAQ TYYKNSAANA LYYGNQISAI IRDQRLLNDE
NLPRMKELIR QKQKEYNLGI VEVYSAQNEE LVRAANPGLP LGEFTNPSSE DIKQGLNGKE
LTRVNTVGKA DLIRGIVPIY STYRADEVVG AVVVNYFVPY SLVEKMREIT ASYEQFRQLK
ILKNPIRSGY ILALFLITMV IIFLAVWFGL HLANSLTTPI QELAEATRQV AEGNLNIRLG
QRTDDELGML VAAFNKMTDD LRSNQLALKN ANDELSRSNQ ELEQRRRHME IVLRNVAAGV
IAVDRGGTIT TINPSAARLL QIDMPRAVGH NFREVLKGEQ LDIVKGVLRD MVMAKKDSIS
RQITVPVRDS RATFLFNLSV LRDESGEFLG TVVVFDDMTQ LIKAQRMAAW REVARRIAHE
IKNPLTPIQL SAQRLRKRYL PRFGDEDRVF DECTAMIVKS VDELKTLVDE FSNFARMPAA
HPSPNDLNAV IREAVTLFRE GHRGVAFGFS ADDRLPLLQL DRDQIKRVFI NLLDNAVAAM
GGTGEVRIMS RFDPELKMAV VTVADTGPGI PPEDKTRVFE PYFSTKASGT GLGLAIVSSI
ITDHHGFIRV RDNEPRGTKF VIELPVTG