Gene Achl_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2998 
Symbol 
ID7294478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3343097 
End bp3344272 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content64% 
IMG OID643591408 
Productflagellin domain protein 
Protein accessionYP_002489048 
Protein GI220913739 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00055355 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAATGC AGATCAACAC CAACCTGGCT GCGAACAACG CTTACCGCAG CCTCAGCAAC 
ACCCAGAACG ACCTGTCCAA GTCACTGGAG AAGCTCTCCA GCGGCCTGCG CATCAACCGC
GCCGGTGACG ACGCGGCCGG CCTGGCAATC TCGGAAAGCC TGAAATCCCA GATCGGCGGC
CTGAACGTTG CTTCCCGCAA CGCCCAGGAC GGCATCGGGC TGGTGCAGAC AGCGGAAGGC
GGCCTCAGCC AGGCACATTC CATCCTGCAG CGCCTGCGCG ACCTGGGCGT CCAGGCCGCG
AACGACACGA ACAACACTGA CTCGCGGGCA GCCATCAAGA CCGAAGCCAC CAGCCTGGTC
GAGGAACTGG GCCGCATCGC CGGCTCCACT GACTTCAACG GCACCAAGCT GCTGAACGGT
GACAACGCTT CCCTGAAGAT CCAGGTTGGT GCCAACGGTG ATGCTGCCAG CCAGATCGGC
GTGGACCTTT CCGGTGCCAA CGTCAAGGCG ATTGCCAACA CGCTGAACCT CGGTGCGCTG
GCCAGTGGCG GCAGCAAGTT CGACATCGCC GACGCCACAG CGCTTGCCGG CGCAGCGACC
TTCAGCTCCA CCAAGGATGG CGTGGTGACC ACGGTTACCA CCGCAGACCT CGGTGCAGCC
GGTTCCTTCA CCAGTGTCGA AGGCTACGCC GACGCACTGC GCAAGGATGC CGACTTCTCC
TCGAAGTTCA CGGTGTCAGT GGAGAAGGAT GCCAATGGCG CGGGCACGGG AATCGTGGTC
CAAGCCAAAG ACGGCGGCGA CCTGCTGGAC GCTGACAACG CAACCGCGGG TACGGGCCTC
GCTGCCGGTG CCGCGACCGC GTCGGTGGCA ACCGGACTGG ACTTCTCCAA CGCGTCCAAG
GCCCAGGCAT CAATCACCCT GATCGACACC CAGATCAAGA ACGTCTCCAC TGCCCGTGCA
GACCTGGGCG CAACCCAGAA CCGCCTGGAA TCCGCTGTGC AGACCATCAA CGTGGCCAAG
GAAAACCTGA CCGCATCCAA CAGCCGGATC CGCGACACGG ACATGGCCGA GGAAATGGTC
AAGTTCACCC GCAACAACAT CCTGTCCCAG GCCGGAACCG CAATGCTCGC GCAGGCCAAC
CAGTCCAGCC AGGGTGTCCT GCAGCTGCTG CGCTAG
 
Protein sequence
MGMQINTNLA ANNAYRSLSN TQNDLSKSLE KLSSGLRINR AGDDAAGLAI SESLKSQIGG 
LNVASRNAQD GIGLVQTAEG GLSQAHSILQ RLRDLGVQAA NDTNNTDSRA AIKTEATSLV
EELGRIAGST DFNGTKLLNG DNASLKIQVG ANGDAASQIG VDLSGANVKA IANTLNLGAL
ASGGSKFDIA DATALAGAAT FSSTKDGVVT TVTTADLGAA GSFTSVEGYA DALRKDADFS
SKFTVSVEKD ANGAGTGIVV QAKDGGDLLD ADNATAGTGL AAGAATASVA TGLDFSNASK
AQASITLIDT QIKNVSTARA DLGATQNRLE SAVQTINVAK ENLTASNSRI RDTDMAEEMV
KFTRNNILSQ AGTAMLAQAN QSSQGVLQLL R