Gene GSU1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU1003 
SymbolntrC 
ID2687478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp1082803 
End bp1084248 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content59% 
IMG OID637125673 
Productnitrogen regulation protein NR(I) 
Protein accessionNP_952057 
Protein GI39996106 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACTGA ACCGCATATT GGTTGCCGAT GACGAAGAAA GCATGCGCTG GGTCCTCTCG 
AAGGCCCTGC GCAAAAAGGG ATTCACCGTG GACCTCGCCC GCGACGGAGA GGAAGCCCTG
CGACTGATCC AGTCCAACGA GTACGACTTG GCCATCCTCG ACATCAAGAT GCCCGGCTTC
ACCGGGCTGG AATTGCTCGA CAAGGTACGG GAGCTCAAGC ACGATCTCCT CATGGTGATC
ATGACCGCCG AGGCGAGCAT GAAAAACGCC GTGGAGGCCA TGAAGCGGGG GGCCTACGAT
TACATCACCA AGCCCTTCGA CCTGGATGTA ATCGATGCCA TCATCGAAAA GGTGCACAAG
GCCCGGGAGA TCACCTCCCA AATGACTATT CTGCGGGAAG AGCTGAAAGA GCGCTATCAC
CTGGAGAAGA ACATCATCGG CAACTCCCCC GCCATGCGGG AAGTCTACAA GACCATCGGC
AAGGTTGCCC CCAGCGACGT GACCGTTCTC GTTCAGGGGG AGTCAGGGAC CGGCAAAGAG
CTCATCGCCC GGGCTATTCA CTTCAACTCC AAGCGGATCG GCAAGCCGTT CATCGCCCTC
AACTGCGCCG CCATTCCCAA AGATTTGCTG GAAAGCGAAC TCTTCGGCTT CGAAAAGGGG
GCGTTCACCG GCGCCGTCGA GCGCAAGCTG GGCAAGTTTG AGCAGGCCAA CGGCGGCACC
ATCTTCCTTG ACGAGATCGG CGACATGCCC CTCGATCTCC AGGCAAAAAT CCTGCGGGTG
CTCCAGGAGA AGGAAGTTAC CCGCACCGGC GGCAGCCAGA ACATCGCCGT GGACGTACGG
ATCGTGGCAG CCACCAACCA GAACCTGGAG GAACTGGTCC GCAAGAAGCA GTTCCGGGAG
GATCTCTTCT ACCGGCTCAA CGTGGTGCCT ATTCAGCTGG TACCGCTGAG GGAGCGTAAG
GAAGACGTGC CGCTTCTGGT GGACTATTTC CTCCAAAACG CCTGCGCGGA ACTGGAGGTT
TCGCCAAAAA AATGCTCTCC CGAGGCCATG GCGCTCCTCA CCACCCACAG CTGGCCGGGC
AACGTACGGG AACTGGAGAA TACCATCAAG CGGGCGGTGA TCCTCTCGTC CGACCCGCTT
CTCACCCCAT CCGACTTTCC GGGGCTGCGT GCCCGCCAGA CGGGAAGCGA GGCGACCGCT
GCGGACGACC TCTCCCTGGA AGCCCTGGTG GACATGAAAC TGCGGGCAAG CCTCACCAAC
CTGGACAAAA TGGAGAGCGG GGATATCTAT AACCTGGTCC TCAAGCAGAT CGAGCGGCCT
CTCATCCGCT TCGTCCTGGA AAAGACGCGT GGCAACCAGG TGAAAGGAGC TGAGATCCTC
GGCATTAACC GCAACACGCT ACGCAAGAAG ATTCAGGAGC TGGGCATCGA ACTGAGAAAA
GACTGA
 
Protein sequence
MLLNRILVAD DEESMRWVLS KALRKKGFTV DLARDGEEAL RLIQSNEYDL AILDIKMPGF 
TGLELLDKVR ELKHDLLMVI MTAEASMKNA VEAMKRGAYD YITKPFDLDV IDAIIEKVHK
AREITSQMTI LREELKERYH LEKNIIGNSP AMREVYKTIG KVAPSDVTVL VQGESGTGKE
LIARAIHFNS KRIGKPFIAL NCAAIPKDLL ESELFGFEKG AFTGAVERKL GKFEQANGGT
IFLDEIGDMP LDLQAKILRV LQEKEVTRTG GSQNIAVDVR IVAATNQNLE ELVRKKQFRE
DLFYRLNVVP IQLVPLRERK EDVPLLVDYF LQNACAELEV SPKKCSPEAM ALLTTHSWPG
NVRELENTIK RAVILSSDPL LTPSDFPGLR ARQTGSEATA ADDLSLEALV DMKLRASLTN
LDKMESGDIY NLVLKQIERP LIRFVLEKTR GNQVKGAEIL GINRNTLRKK IQELGIELRK
D