Gene Pars_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1404 
Symbol 
ID5054222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1267135 
End bp1268286 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content61% 
IMG OID640468947 
Productaminotransferase class-III 
Protein accessionYP_001153616 
Protein GI145591614 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4992] Ornithine/acetylornithine aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.443564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.505281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGAA TTGCCAAGTA CTATAGAGAG TACGGAGTAC GTATTGTGAG GGGTTTTATG 
CAGTACGTCT GGGACGACCA GGGGAGGCGC TATATTGACT GCAACACCAA CCACGGCGTC
GTCTTCCTGG GACACGCCAA CCCCAAAATC GCAGAGGCCG TCAAGAGGCA GTTGGAAGAG
GTCTGGGCTG TGCCCCTCAA CTTTTCTACC CCGGCCAGGG AGAGGTTTAT TGAAGAGTTC
TCGAAGCTTC TGCCCGCCAA GTTCGGGGTC GTCTTCCTAC AGAACACCGG CACGGAAGCC
GTGGAGGTGG CCATCAAGAT CGCCAAGAAG GTCACCAAGA GACCCACAAT AGTGGCCTTC
ACCAACAGCT TCCACGGAAG GACCATGGGC TCCCTCTCAA TCACCTGGAA CGAGAAGTAT
AGGAAGGCCT TCGAGCCCCT CTACCCCCAC GTCCGCTTCG GCAAGTTCAA CGTGGCGGCT
GAGGTGGATA AGGTTATCGG CGAGGACACC TGCTGCGTAG TCGTGGAGCC CATCCAGGGC
GAAGGCGGGG TGAATCCTGC CACGCCGGAG TTCCTCAAGG CGCTCAGAGA GGAGACTCAG
CGCAAAGGCG CCCTGCTCAT AATAGACGAG GTCCAGACCG GCTTCGGCAG AACAGGCGCT
GTGTGGGCCT TCCAGAAATA CGGCGTAGAG CCCGACATAT TCACCGCCGG CAAGCCCGTG
GCAGGGGGCC TCCCCATAGG CCTCGCCGTA GCCAAGGAGG AGTTCGGAGA CGTCTTCGAG
CCGGGGGAGC ACGGATCCAC CTTCGCCGGC AACGCGGTGG TTATGGCCGC TGCCGCCGCC
GCCTCCCGGC TCCTTAGGGA GGAGGACGTC CCCGCCAAGG CCGAACGCGC CGGCGCTGAG
CTCGCCAAGG CCCTCGCCGA GACGGGTTCC CGCCTAGCCG TCAGGGTAAA GGGCATGGGC
CTAATGCTCG GGCTAGAGCT GAGGGTAAAG GCAGACCAGT TCATACAGCC CCTCCTAGAC
CGCGGAGTAA TGGCCCTCAC CGCGGGGGTC AACACCCTGA GGTTCCTCCC ACCCTACATG
ATCTCAAAGG AGGACGTGGA GGTGGTACAC GGAGCCGTCT CCGAGGTTTT AAAAAAGGCA
GAAGCCCCCT AG
 
Protein sequence
MGRIAKYYRE YGVRIVRGFM QYVWDDQGRR YIDCNTNHGV VFLGHANPKI AEAVKRQLEE 
VWAVPLNFST PARERFIEEF SKLLPAKFGV VFLQNTGTEA VEVAIKIAKK VTKRPTIVAF
TNSFHGRTMG SLSITWNEKY RKAFEPLYPH VRFGKFNVAA EVDKVIGEDT CCVVVEPIQG
EGGVNPATPE FLKALREETQ RKGALLIIDE VQTGFGRTGA VWAFQKYGVE PDIFTAGKPV
AGGLPIGLAV AKEEFGDVFE PGEHGSTFAG NAVVMAAAAA ASRLLREEDV PAKAERAGAE
LAKALAETGS RLAVRVKGMG LMLGLELRVK ADQFIQPLLD RGVMALTAGV NTLRFLPPYM
ISKEDVEVVH GAVSEVLKKA EAP