Gene Achl_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3020 
Symbol 
ID7294500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3361679 
End bp3362908 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID643591430 
Productglutamate--cysteine ligase GCS2 
Protein accessionYP_002489070 
Protein GI220913761 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.163677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGGC AGGAAGCAGC AGGGCAGGAC CCGCGGCGGC AAGGCAGCAC GACGGCGGCG 
GGCGGCGTCC GGAATGACGG CGCCGGGCGT CCCGGGCCCA GGACGTTCGG GGTTGAGGAG
GAACTGTTGC TGGTGGACCC CGGCCGGGGC GACGCGGTAC CCATGGCCGG CGCCCTGCTG
GACCTTTACG TCCGCCCGCT GGAATCCAGT GCCGGGCCGG TGCTCACCGC CGAGTTCCAG
CAGGAAATGA TCGAAGTGGT CACCCCGCCG CACTCCACGC TTGCCGGGCT CCAGGCGGAC
ATCGTTGCGG GGCGGGACAT CGCCCGGCAG GCCGCGGAGG ACGTGGGCGT CCGGGTGGCT
GCTCTGGGCA CTTCCCCCCT GCCGAGCGAC CCGCACCCGG TGCGGCTGCG CCGGTTCGCG
GCCATGGTGG AAGAGTATGG ACTCACTGCC CGGGAACAGC TGACCTGTGG CACCCACGTC
CACGTTTCGG TGGAGTCTGA CGAAGAAGCG GTGGGAGTGC TGGACCGGAT CCGGAACTGG
CTGCCGGTGC TGGTGGCGCT CAGTGCCAAC TCCCCGTTCT GGCATGGAGA GGACACTGGG
TACGCGAGCT ACCGGTCCCA GGTGTGGAGC CGGTGGCCGT CTGCCGGGCC ACTGGACATC
CTGGGCACCC CGGATGCCTA CCACCAGCTG GTGCACGACA TGGTGAGCAC CGGCGTCGCC
ATGGATGAAG GCATGATCTA CTTCGACGCC CGGCTGTCCC GGCACTATCC CACCGTGGAG
GTGCGGATCG CCGACGCCTG CATGATGCCG GAAAACACCG TGCTGCTGGC CGGGATCGTC
CGCGGACTGG TGGAAACCGC GGCCCGCGAA TGGAAGGCCG GAACCGGGCC GGCGCCGGTG
CCCACCGCCC TGCTGCGGCT GGCTGGATGG AAGGCCAGCC GCTGGGGGCT GCGCGGGGAA
CTCCTGGATC CGGTGACCAG CAGGCCCGGG CCGGCCCTCG GCGTCGTCAA TTCCCTCCTG
CACCATATCC ACGGCGCGCT GGAGGACATG GGCGACCTGG AGCGGGTGGA GGAACTCACG
GACCAGCTCC TGCACACCGG CACCGGAGCC GTCCGCCAGC TCGAGGTGCT GCACCGGACG
GGCGACCTGG AGGACGTGGT GGATGACGCC GCCAACTGCA CCGTGGGGTC CGAAATCCAA
GGTGCGCGGC GGGGAATGCC GGGGGATTGA
 
Protein sequence
MDGQEAAGQD PRRQGSTTAA GGVRNDGAGR PGPRTFGVEE ELLLVDPGRG DAVPMAGALL 
DLYVRPLESS AGPVLTAEFQ QEMIEVVTPP HSTLAGLQAD IVAGRDIARQ AAEDVGVRVA
ALGTSPLPSD PHPVRLRRFA AMVEEYGLTA REQLTCGTHV HVSVESDEEA VGVLDRIRNW
LPVLVALSAN SPFWHGEDTG YASYRSQVWS RWPSAGPLDI LGTPDAYHQL VHDMVSTGVA
MDEGMIYFDA RLSRHYPTVE VRIADACMMP ENTVLLAGIV RGLVETAARE WKAGTGPAPV
PTALLRLAGW KASRWGLRGE LLDPVTSRPG PALGVVNSLL HHIHGALEDM GDLERVEELT
DQLLHTGTGA VRQLEVLHRT GDLEDVVDDA ANCTVGSEIQ GARRGMPGD