Gene Achl_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1967 
Symbol 
ID7293428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2217701 
End bp2219032 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content64% 
IMG OID643590371 
ProductCBS domain containing protein 
Protein accessionYP_002488030 
Protein GI220912721 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00000080224 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACGCCCC TGCTCCTGGT GGCGATGGCC CTGGTTTTCC TGGCCATCGC CGCCGTTCTG 
ACTGCGGCCG AGGCCGCCTT CAATTTCCTC CCGCGCCACG ATGCGGAAGA AGCTCTGCTC
CGGAGCCGCG GAACGGCTAT GCGCAGCATC CTCCGCGAGC CCGTTGCCCA CATCCGGGCC
CTGAGGTTCT GGCGTGTCTG GTTTGAAATG GCCTCCGCCG TGGCCGTCTC GGTGCTGCTT
TACAGTCTCC TGGACAACAT CTGGCTGGCC GGCCTTGCAG CAACCGGCAT CATGGCGCTG
CTGGGGTTCG TGATCGTGGG TGTCTCCCCC CGCCAGCTGG GCAGACTCCA CTCGTCCGCG
GTAGTGCGCT TCACTGCGCC GATGATCCGT TTCCTGACGT GGGTACTGGG GCCCATCCCC
GGATGGCTCG TCGCCGTGGG TAGCGCTGCC GCGCCAGGCG CCCCCGGCGG GGATGAAGCC
TTCTTCAGCG AACAGGAATT CCGTGAACTG GTGGACCGCG CCAGCGAATC CGACATGATC
GAGGACACCG AAGCTGAGAT GATCCAGTCG GTGTTCGACT TCGGGGACAC GCTGGTCCGT
GCCGTCATGG TGCCCAGGAC GGACATCGTT GGCATCGACT CCGGTTCGAG CCTGCACGAG
GCCATGTCCC TGTTCCTCCG GTCCGGCTAC TCCCGGGTCC CGGTTATCGG CGAGGACACC
GACCACATCC TGGGAATCGT CTACCTCAAG GACGTGGCCG CCGTCGTGCA TGAACTGGAC
GCCGGCGTCG AACCGCCCAC GGTCGACTCC ATGGCACGCG AGGTCCGCTA CGTGCCGGAA
TCAAAGCCGG TGAGCGACCT GCTGCGGGAA CTGCAGAAGG AGTCCACACA TGTGGCCATC
GTCATCGACG AGTACGGCGG CACGGCCGGC CTGGTCACGC TTGAGGACCT GATCGAGGAA
ATCGTGGGCG AGATCGTTGA CGAATACGAC ACCGAAAGCG CCGAGGCCGT AGAGCTCGGC
GACGGTTCCT ACCGGGTCAG CTCCAGGATG AGCATCGATG ACCTCGGAGA GCTTTTTGAC
ATCGAGCTTG ACGACGACGA AGTGGACACC GTGGGCGGAC TGCTCGCCAA AGCGCTGGGA
CGTGTTCCCA TCGTAGGCAG CAGGGTGGAA GTCCAGGGAG TATCGCTGCG CGCCGACCGG
CTCGAGGGCC GCCGCAACCG CGTCAGCCAT ATCATTGCGG CACCCGTGCC AACAGTAGAC
ACTGACCTTG AAGACCTCTT CCATGAGGCG GAAGCAACCC AGCAGGGAGT TCCACGTGAG
CAAGAAAAGT AA
 
Protein sequence
MTPLLLVAMA LVFLAIAAVL TAAEAAFNFL PRHDAEEALL RSRGTAMRSI LREPVAHIRA 
LRFWRVWFEM ASAVAVSVLL YSLLDNIWLA GLAATGIMAL LGFVIVGVSP RQLGRLHSSA
VVRFTAPMIR FLTWVLGPIP GWLVAVGSAA APGAPGGDEA FFSEQEFREL VDRASESDMI
EDTEAEMIQS VFDFGDTLVR AVMVPRTDIV GIDSGSSLHE AMSLFLRSGY SRVPVIGEDT
DHILGIVYLK DVAAVVHELD AGVEPPTVDS MAREVRYVPE SKPVSDLLRE LQKESTHVAI
VIDEYGGTAG LVTLEDLIEE IVGEIVDEYD TESAEAVELG DGSYRVSSRM SIDDLGELFD
IELDDDEVDT VGGLLAKALG RVPIVGSRVE VQGVSLRADR LEGRRNRVSH IIAAPVPTVD
TDLEDLFHEA EATQQGVPRE QEK