Gene Hlac_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2199 
Symbol 
ID7401134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2182526 
End bp2183866 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID643709271 
Productaminotransferase class-III 
Protein accessionYP_002566846 
Protein GI222480609 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCG ATACCGCCGC GCCCGACGTG ACTGACCTCC CGGGCGACCG CGCACGAGAG 
TGGGTGGAGT ACCACCACGA GTCGGCCGCG CCGAGCACGT ACGTCTACGA GTTCGTCTGG
GACCGCACCG CGCCCGCCGA AGGACCGTTC TGCACCGACG TCGACGGCAA CGTCCTCATG
GACTTCACGA GCCACGTCGC CGCCGCGCCG CTGGGGTACA ACAACCCGAA GATTATGGAG
CCGCTCGCGG AGTTCGACCT CGTCGACCCG CTGAAGATCG CGGGCCAGGA CTTCTACGTC
GCCGGCGGCG AGTCGCCCGG CGACGGGCTT CCGGGTTCGT CCGGGTTGAT GGAACGGCTC
ACTGAGATCA CCGCCCACTA CGACATGGAC ACCGTCTTCC TCTCGAACTC GGGGGCAGAG
GCGGTCGAGA ACGCGATCAA GATCGCGTAC GACGACTCCG GCGGCGCCAA ACACGCGATC
ACGTTCGACG GCGCGTTCCA CGGGCGGACG CTCGGCGCGC TCTCGCTCAA CCGCTCGAAA
TCCGTGTATC GCCGCGATTT CCCGGAGATC AGCGGGATTC ACGACGCACC CTTCTGCGAC
GACCGGAACT GCACCGCCGA GACCTGCTCG TGCGGCTTCT TCGTCGACGG CGCGTCGCAA
CTCCGACGCA AGCTCGACCC CGAGCGCGGT CACATCGACC CCGACGACGT AGCGTACCTC
ATCTTAGAGC CGATCCAAGG GGAAGGGGGA TACCGGTTCC CCTCCGACGC GTTCACCGAC
GAGATCGCCG CCTTGGTCGA CGAACACGAC ATCACGCTGA TCGCCGACGA GATCCAGTCG
GGCGTCGGTC GCACCGGCGA GATGTGGGGC TCGGACCACT ACGCGCTCGA ACCCGACGTG
ATCACCAGCG CGAAGGGACT CCGTGTCGGC GCCACGATCT CCCGCTCGGA CGTGTTTCCC
GAGGAAAAGA GCCGGCTCTC CTCGACGTGG GGGGCGGGCG ACATCATCGC TTCCGCGCAG
GGCGCGCTCA CGCTCGACGC GATCCGTGAG CACGACCTGA TGGACAACGC CACGGTTCGA
GGGCGACAGT TCAAAGAGAC GATGCGCGAC GCCGACCTCC CGGGCGTCGA CGACGTGCGC
GGGAAGGGGC TGCTGCTCGC GGTCGAGTTC GACTCGAAGG AGCGCCGCGA CGCGGTCCAG
AAAGGCGCGT TCTCCCGGGG CCTGCTCACG CTGGCGTGCG GCCACGACGT ACTCCGCGTC
CTCCCGCCGC TCGACGTCAC CGAACGCGAG ATCGAGCTCG GCTGCGACCT CCTCACGAGC
GCGATCGCCG ACGCGGCGTA G
 
Protein sequence
MDRDTAAPDV TDLPGDRARE WVEYHHESAA PSTYVYEFVW DRTAPAEGPF CTDVDGNVLM 
DFTSHVAAAP LGYNNPKIME PLAEFDLVDP LKIAGQDFYV AGGESPGDGL PGSSGLMERL
TEITAHYDMD TVFLSNSGAE AVENAIKIAY DDSGGAKHAI TFDGAFHGRT LGALSLNRSK
SVYRRDFPEI SGIHDAPFCD DRNCTAETCS CGFFVDGASQ LRRKLDPERG HIDPDDVAYL
ILEPIQGEGG YRFPSDAFTD EIAALVDEHD ITLIADEIQS GVGRTGEMWG SDHYALEPDV
ITSAKGLRVG ATISRSDVFP EEKSRLSSTW GAGDIIASAQ GALTLDAIRE HDLMDNATVR
GRQFKETMRD ADLPGVDDVR GKGLLLAVEF DSKERRDAVQ KGAFSRGLLT LACGHDVLRV
LPPLDVTERE IELGCDLLTS AIADAA