Gene Achl_0962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0962 
Symbol 
ID7292404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1054184 
End bp1055374 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content71% 
IMG OID643589368 
Productprotein of unknown function DUF125 transmembrane 
Protein accessionYP_002487046 
Protein GI220911737 
COG category[S] Function unknown 
COG ID[COG1814] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCAGC ACGCCCAGTC CACGCCGAAC CATTCAGCGC CGCACCATTC CAGCCCGCAC 
CCATCCAGCC CGGAGCCCGC CCCGCGAACC GGCCAGACGG GCCATCATCC CGCACCGTCA
TCCGCGGACC TCAAGCGGTG GCGCCAGTAC CTTGCTGACG AACGGGCCGA AGCCGCCGTT
TACCGCGACC TGGCCCAAAA CCGTGAGGGC GAGGAGCGCG AAATCCTCCT GGCCCTTGCC
GAAGCCGAAG GCCGCCATGA GGCCCACTGG CTGGGATTGC TCGGCGACCA CGCCGGGAAA
CCGCGCCCCG CCTCGGCCCG CAGCCGGATG CTCGGATTCC TGGCCCGCCA CTTCGGATCC
GTCTTCGTGC TCGCGCTGGC GCAGCGCGCC GAAGGACGGT CGCCCTATGC CAAGGATCCC
AACGCCACGG ACGCCATGGC CGCGGATGAA CAGATCCATG AGGAAGTGGT CCGCGGGCTG
GCAACACGGG GCCGCAACCG CCTGGCCGGC ACTTTCCGCG CCGCGGTGTT CGGCGCCAAC
GACGGCCTGG TCAGCAACCT GTCACTCGTA ATGGGCATGG CGGCCTCCGG CGTCGCCAGC
AGCGTGGTGC TGCTCAGCGG CATCGCGGGC CTCCTGGCCG GCGCCATGTC CATGGGCGCC
GGTGAGTTCA TCTCCGTCAG GTCCCAGCGT GAACTGCTGG CCGCCACCCG GCCCACGCAG
GTCACGCTGG CCGCGGCACC CAAGCTGGAC CTGGAGCACA ACGAGCTCCT GCTGGTGTAC
CTGGCCCGGG GCATGTCCCA CGAAGCAGCC GAACACCGGG TGGCCGAACG CACGGGCCTG
CTCTCCTGCG ACTGCGACCC CAGCCTGTCC CTCCAGCCCG AGCTGCCGGA GGAGGAGGAC
CAGCACGAGG CCGTGGGCAC CGCCTGGGGA GCCGCATTGT CCAGCTTCTG CTTCTTCGCC
TCCGGCGCCA TCATCCCCAT CCTGCCGTTC CTGTTCGGCC TGACCGGGGT CTCAGCCCTG
GTGGTTGCCG GCGCCCTGGT GGGCGTCGCA TTGCTGGCAA CCGGCGGCAT CGTCGGCCTG
CTGTCCGGCA CGTCACCCCT CACCCGGGGA CTGCGCCAGC TGGGCATCGG CCTGGGCGCG
GCCGCCGTCA CTTATCTGCT GGGGCTCGTC TTCGGCACCG TCGTCGGCTA G
 
Protein sequence
MSQHAQSTPN HSAPHHSSPH PSSPEPAPRT GQTGHHPAPS SADLKRWRQY LADERAEAAV 
YRDLAQNREG EEREILLALA EAEGRHEAHW LGLLGDHAGK PRPASARSRM LGFLARHFGS
VFVLALAQRA EGRSPYAKDP NATDAMAADE QIHEEVVRGL ATRGRNRLAG TFRAAVFGAN
DGLVSNLSLV MGMAASGVAS SVVLLSGIAG LLAGAMSMGA GEFISVRSQR ELLAATRPTQ
VTLAAAPKLD LEHNELLLVY LARGMSHEAA EHRVAERTGL LSCDCDPSLS LQPELPEEED
QHEAVGTAWG AALSSFCFFA SGAIIPILPF LFGLTGVSAL VVAGALVGVA LLATGGIVGL
LSGTSPLTRG LRQLGIGLGA AAVTYLLGLV FGTVVG