Gene Achl_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1738 
Symbol 
ID7293198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1956062 
End bp1957357 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content69% 
IMG OID643590148 
Producttype III effector Hrp-dependent outers 
Protein accessionYP_002487808 
Protein GI220912499 
COG category[S] Function unknown 
COG ID[COG3395] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.0119414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCTT TCGGTTTCGT TGCGGACGAC CTCACCGGAG CCGCCGACGT CCTGGCCCAG 
GCGCACCGGT ACGGCCTCGA AGCAGCCCTC GTCATCGGTG ACGCGCCCCT TCCCACGGAT
GCCGCCGTCG TCGGCTTCGC GGGGCCTGCG CGCTCCCTTG CGGGGACCGC GTTCGACGCC
CTGGTGAGCC GCGACCTCGC CGGCATCGCC GCCCTGAACC TGGACGTGCT GCTCTATAAG
GTCTGTTCCA CCTTCGACAG CTCCCCCACC GTGGGCAGCA TCGGCCGGGG AATCGAACTG
CTGCACGAAC AGTTCCCGCT GCACGGGGCC ATCCCCGTGA TTCCCGCCCA ACCCGGATTT
GGCCGGTACA CAGCGTTCAG CAACCACTAC GCAACGTATG CCGGGCAGTC CTACCGGTTG
GACCGCCACC CAGTCATGTC CCGGCACCCA TCCACCCCGA TGTCCGAAGC AGACCTGCGC
GAGGTCCTGG CCGAACAGCT CACGTCAGGC ACGACGCCGG GTGCGATCCA CCTGCCGGCG
TATGAGGACG GGACGTTCAA GGACGCCTGG GCAGACCGCC GGCACGAACC CGGCGCGCAG
GCCTTCGTGG TTGACGCCGT GGATGAACAC CACATGGATG CCGTGGCTGA AGTCCTGACC
CGCGAAGAGC ACGGCCACGG CCCGTCCATC GTCGTCGGAT CCGGCGGCAT CATGGCCGCG
CTGGCGAGGA CCCTCTCAGA CAGCGTTCCG GCGGCACCCG GCGCTCAGGC CGCCTCCGGA
CCTGCGCTGG CAGTCAGCGC CTCGGCGTCG AGCACCACGG CGGAACAGAT CAGTGATGCC
GTGGCCCACG GGTGGGTGGA AGTACCCGTC CCCGTCGAAT TGCTGGATCG CCACAGCCCG
GCGCTCGTGG CGGCTTTGGA CGAGCGCGTC TCCGCAGCGC TCCGCGCAGG CAACAACGTC
GTCGTCCACA CCACCCGCGG TGCCGGCGAC CCCCGCTATG GCACCGCTAA GCCGGTCGAT
GCAGGCTACG TTGGCGCACT CATCGGCGGC ATCGCCGCCA GGATCGCCCA AGCGGGCCTG
ACCCGCGACA TCGCCGTCTT CGGAGGCGAC ACCTCCAGCC ATGCACTCAT CGCCATGGGA
GTGCGCCAGC TCCGCGTTTC CGGGCAGTTC GTCACCGCCG GGCCGATCCT CAAAGCCGAC
GGCGCCTCCG CAGTTGCCGG ATGCCGCCTC CTTCTCAAGG GTGGCCAGGT TGGCCCCACC
GACATCCTGC GCCGGTTCGC CGGACAACCC CGGTAA
 
Protein sequence
MPAFGFVADD LTGAADVLAQ AHRYGLEAAL VIGDAPLPTD AAVVGFAGPA RSLAGTAFDA 
LVSRDLAGIA ALNLDVLLYK VCSTFDSSPT VGSIGRGIEL LHEQFPLHGA IPVIPAQPGF
GRYTAFSNHY ATYAGQSYRL DRHPVMSRHP STPMSEADLR EVLAEQLTSG TTPGAIHLPA
YEDGTFKDAW ADRRHEPGAQ AFVVDAVDEH HMDAVAEVLT REEHGHGPSI VVGSGGIMAA
LARTLSDSVP AAPGAQAASG PALAVSASAS STTAEQISDA VAHGWVEVPV PVELLDRHSP
ALVAALDERV SAALRAGNNV VVHTTRGAGD PRYGTAKPVD AGYVGALIGG IAARIAQAGL
TRDIAVFGGD TSSHALIAMG VRQLRVSGQF VTAGPILKAD GASAVAGCRL LLKGGQVGPT
DILRRFAGQP R