Gene GSU2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2049 
SymbolargJ 
ID2686046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2248053 
End bp2249234 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content63% 
IMG OID637126740 
Productbifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein 
Protein accessionNP_953098 
Protein GI39997147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000663169 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTCA AGGGATTTCG GTTCTCGGCC GTTGAAGCGG CCATTAAGAA GCCGGGCCGT 
CTGGACTTGG CCCTCATCTG CTCGGACGCG CCCGCTGCAG TTGCCGCGGT TTACACCACC
AACAAGGTGA AGGCAGCGCC GGTGCTTCTG GACATGGAGC GAACCACGAG CGGCACCTGC
CGCGCGGTGG TGGTCAACAG CGGCAACGCC AATGCCTGCA CCGGAGACCG GGGGATGGAG
GACGCGCGGG AAACCACCAG CCTCGTGGCC GAACGGATTG GTGCATCTGA GCACGAGGTG
CTCGTATGCT CTACCGGCGT GATCGGCGTG CCGCTCCCCA TGGAGCGGAT CAGGGGAGGG
ATTCCTTCCC TCGTGGCCGG GCTGGGTTCA GCGACCCTCG ATCAGATCGC CGCGGCCATC
ATGACAACCG ACACCTTCCC GAAACTGGAG GCGCGTACCG GGACTGCGGG AGGCGTCGGG
TACACCATCG CCGGTATCGC CAAGGGCGCC GGCATGATCA TGCCGAACAT GGCCACCATG
CTCGCCTTTG TCGTCACCGA TGCCGCAGTG GACCCCCAGT GGCTCGACCG GGTTTTCCGC
CGCGCCAACG ATACCTCTTT CAATGCCATC ACCGTGGACG GCGACATGTC CACCAACGAT
ACCGCCATCA TTATGGCCAA CGGAGCAGCC GGCAACCCGG TTCTGTCCGA GGGGAGCGAG
GGCGCCGCGG AATTTGCTGT TCTTTTGGAG GAGGTGCTCC TCTCTCTGGC CAAGCTGATC
GTCAAGGATG GAGAAGGGGC CACCAAGTTT GTGGAAGTAA CCGTGAAGGG TGCCCGCTCC
GATGCCGACG CCAAGCGGGC CGCCATGGCC GTCGCCAATT CATGCCTGGT GAAGACCGCC
TTTTTCGGGC AGGATGCCAA CTGGGGGCGG ATTTTCGCGG CGGTGGGCTA CTCCGGCGCG
GACGTGGAAC CGGACCGTGC CGAGCTGTTT TTCGACGATG TCAGGATGGT ACAGGGTGGT
GTTTTCGCAG GCGGCGACGC TGAGGCGCGG GGTACCGGGG TATTGCGGAA GAAGGAGTTC
ACCGTTACTG TAGACCTGCA TCTGGGCGAC GGACGGGCAA CGGTTTACAC CTCGGACCTG
TCCTACGACT ACGTCAAGAT CAACGCCGAT TACCGTACCT GA
 
Protein sequence
MNVKGFRFSA VEAAIKKPGR LDLALICSDA PAAVAAVYTT NKVKAAPVLL DMERTTSGTC 
RAVVVNSGNA NACTGDRGME DARETTSLVA ERIGASEHEV LVCSTGVIGV PLPMERIRGG
IPSLVAGLGS ATLDQIAAAI MTTDTFPKLE ARTGTAGGVG YTIAGIAKGA GMIMPNMATM
LAFVVTDAAV DPQWLDRVFR RANDTSFNAI TVDGDMSTND TAIIMANGAA GNPVLSEGSE
GAAEFAVLLE EVLLSLAKLI VKDGEGATKF VEVTVKGARS DADAKRAAMA VANSCLVKTA
FFGQDANWGR IFAAVGYSGA DVEPDRAELF FDDVRMVQGG VFAGGDAEAR GTGVLRKKEF
TVTVDLHLGD GRATVYTSDL SYDYVKINAD YRT