Gene BURPS1106A_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3539 
SymbolargJ 
ID4902468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3440486 
End bp3441772 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID640136765 
Productbifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein 
Protein accessionYP_001067775 
Protein GI126455430 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGCTC GAACGCCGGC ATTTTCCTTC GGGCAGGTGC TTCAGATGGC TGTCAATTTT 
CCCTCGATCG ATCCCGCGCA ATTGCATCCC GTCGCCGGCG TGACGCTCGG CTGGGCGGAG
GCGAACATTC GCAAGCCGAA CCGCAAGGAC GTGCTCGTCG TCTCCGTCGA AGAGGGGGCC
ACCGTGTCGG GCGTGTTCAC CGAGAACCGC TTCTGCGCGG CGCCCGTGAC GGTCTGCCGC
GAGCATCTGG CGAAGGTGCG CGCGGGCGGC GCCGGCATTC GCGCGCTCGT CGTCAACACG
GGCAACGCGA ACGCGGGCAC GGGCGAACCG GGCCTCGCGC ATGCGCGCGA GACGTGCGCG
GAACTCGCGC GCCTCGCGGG CATCGCGCCC GGGCAGGTCC TGCCGTTCTC GACGGGCGTG
ATCCTCGAGC CGCTGCCGAT CGAGCGCCTG AAGGCCGGCC TGCCCGCCGC GCTCGCCAAT
CGCGCGGCCG CGAACTGGCA CGACGCCGCG CAGGCGATCA TGACGACCGA CACGCTGCCG
AAGGCGGCAT CGCGCCAGGT GACGATCGAC GGCCACACGA TCACGCTGAC GGGCATCAGC
AAGGGCGCCG GGATGATCAA GCCGAACATG GCGACGATGC TCGGTTTCCT CGCGTTCGAC
GCGAAGGTCG CGCAGCCCGT GCTCGACGCG CTCGTGAAGG ACGTGGCCGA CCGTTCGTTC
AACTGCATCA CGATCGACGG CGATACGTCG ACGAACGACT CGTTCATCCT GATCGCGTCC
GGCAAGGCGA GCCTGCCGCA GATCGCGTCG ACCGATTCGC CGGCGTACGC GGCGCTGCGC
GAGGCGGTGA CCTCCGTCGC GCAAGCGCTC GCGCAATTGA TCGTGCGCGA CGGCGAGGGC
GCGACGAAAT TCATCACGGT GACGGTCGAA GGCGGCAAAA GCGCGGCCGA GTGCCGCCAG
ATCGCCTATG CGATCGGCCA TTCGCCGCTC GTGAAGACGG CGTTCTACGC GTCGGACCCG
AACCTCGGCC GGATTCTCGC GGCGATCGGC TATGCGGGCG TCGCGGATCT CGACGTCGGC
AAGATCGACC TGTATCTCGA CGACGTGCTC GTCGCGAAGG CGGGCGGCCG CAATCCCGCG
TATCTCGAAG AGGACGGCCA GCGCGTGATG AAGCAAAGCG AGATCGCCGT GCGCGTGCTG
CTCGGCCGCG GCGACGCGCA AGCGACGATC TGGACTTGCG ATTTGTCGCA TGATTACGTG
AGCATCAACG CCGACTATCG TTCTTAA
 
Protein sequence
MPARTPAFSF GQVLQMAVNF PSIDPAQLHP VAGVTLGWAE ANIRKPNRKD VLVVSVEEGA 
TVSGVFTENR FCAAPVTVCR EHLAKVRAGG AGIRALVVNT GNANAGTGEP GLAHARETCA
ELARLAGIAP GQVLPFSTGV ILEPLPIERL KAGLPAALAN RAAANWHDAA QAIMTTDTLP
KAASRQVTID GHTITLTGIS KGAGMIKPNM ATMLGFLAFD AKVAQPVLDA LVKDVADRSF
NCITIDGDTS TNDSFILIAS GKASLPQIAS TDSPAYAALR EAVTSVAQAL AQLIVRDGEG
ATKFITVTVE GGKSAAECRQ IAYAIGHSPL VKTAFYASDP NLGRILAAIG YAGVADLDVG
KIDLYLDDVL VAKAGGRNPA YLEEDGQRVM KQSEIAVRVL LGRGDAQATI WTCDLSHDYV
SINADYRS