Gene Achl_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1840 
Symbol 
ID7293300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2084447 
End bp2086756 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content68% 
IMG OID643590245 
Productsulphate transporter 
Protein accessionYP_002487905 
Protein GI220912596 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0288] Carbonic anhydrase
[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000017022 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCCCG GCCCTGCCAC AACAACAGCC TCCAACCACA GCCCGCCCGG AGGGCGCCGC 
GAACGGGCCA ACCGCATCCG GCCCTTCCTG TCGAACCTGG GCGCGGATGT ACCCGCCTCC
CTGGTGGTGT TCCTCGTGGC GCTGCCGTTG TCCCTCGGGA TCGCCGCTGC CTCCGGTGCG
CCCATCATGG CCGGACTCAT CGCCGCAGCC ATCGGCGGCA TCGTTGCCGG CAGCCTGGGC
GGCGCTCCGC TGCAGGTCAG CGGGCCGGCC GCGGGCCTGA CCGTGATTGT TGCCGGCCTG
GTCCAGGAAT TCGGCTGGCA GGCCACCTGT GCCATCACGG CCGCAGCCGG CGTTGTGCAA
CTTCTGCTCG GCGTGAGCCG GGTGGGGAGG GCTGCGCTGG CGGTCTCGCC AGTGGTGGTC
AAGGCGATGC TGGCCGGCAT CGGCGTGACC ATCATGGTCC AGCAGATCCA CGTCCTGCTG
GGCTCCGGCC CGGCAGGCTC CGCCATCGAA AACCTCGCCA ACCTTCCGGC GGCCATCACC
AACGTCGAGA TCCATGCGGC ACTGCTTGGA CTCACCGTGG TGATCATCCT GGTGGCCTGG
AAGCACCTGC CTGCCGCCGT ACGGAAAATC CCCGGCCCGC TGGCTGCCGT TGCTGCCGTC
ACCGCGTTGT CGGTGCCGCT GGCGCCGGCC GTGGAGCGGA TCTCCTTTTC GGGCTCCATC
CTGGACGCCG TGGCCCTGCC TGCACTGCCC GAAGGAAACT GGCGCGCCAT AGCCTTCGCT
GTCCTGTCAA TGGCGCTCAT TGCCAGTATC GAATCGCTCC TGTCAGCCGT CGCCGTGGAC
AAGATGCACT CCGGTCCGCG GACCAACCTC AACAAGGAGC TGATGGGCCA GGGAACGGCC
AACATCCTCT CCGGTGCCCT GGGCGGCCTG CCCGTCACGG GCGTCATTGT ACGCAGCGCC
ACGAACGTGG AAGCCGGTGC CAAGTCACGG ACTTCCGCCA TCCTGCACGG CGTGTGGATC
CTCATCTTCT CGGCGCTTTT CGCCGGCATC ATCCAGCTCA TTCCGCTGTC GGTGCTGGCG
GGGCTGCTGC TGGTCATCGG GGCCAAGCTG ATCAAGGTTG CGGACATCCG CACCAGCCGG
CGCACCGGTG ACCTGCTGAT CTACGTCGTG ACCCTCTTCT GCGTCGTTTT CCTCAATCTC
CTGGAGGGCG TGCTCATCGG CCTGGCGCTT GCCGCGGCCA GCGTGCTGTG GCGCGTCCTG
CGCGCGGCGA TCCGCGTACA CGAGCCGGTC TCGCCGTCGT CGGCCTGGCG TGTCACCATC
GCCGGATCGT GCAGCTTTTT CGCCCTGCCG CGCCTGAACC GCGTGCTGCA CTCGGTGCCC
GCAGGCAACA ACGTGGTGAT CGAACTCAAT GCCGATTACG TGGACCACGC CTTCCGTGAA
TCACTGGTGG CCTGGCGTGA CCAGTACCGT GCCGCAGGCG GCTCAGTCGA AGTTGAGGAA
CACGGGAACA CCCTGTTCCA GGACGCTGAG CACAGGGCAC CCCAGCGGCA GGAAGCCCGC
GAGCTTCCGC TGCCGCCGCG CAACTCCCGG ACTGCGGACG GCGAAGATGC CTCCAGCCAG
CTGGGCAGTG AACGGACCCC GGCAGTCCTG GCGGGAATCA GCAAGTACCA CCGCCGCTTC
GCTGACCAGG TCCGTCCGCT GGTGGAGGAC CTGGCGGAGC AGCAGCATCC GGATACACTT
TTCGTGGCGT GCGTCGATTC GAGGGTCAAC CCCAACCTGA TCACCAGTAG CGGCCCCGGC
GACCTGCTGA CCCTCCGCAA CATCGGCAAC GTGGTGTGCC ATGACGGCCA GGACGCCTCC
ATTGACTCGG CGCTTTCGTT CGCGGTCAAG GGCCTCGAGG TAAACACCAT CGTAGTGTGC
GGGCACTCGA ACTGCGGTGC CATGAAGGCC GTCATCGCCG ATGCCGAAGG TGCGGGGAAT
CCGGGACTGG GCACCGGATT CGACGCCTGG CTGGAGCATG CCCGTCCCAG CTACCTCGAG
CTGATGGCGG ACCACCCCGT GGCGCGGGCC GCTGCGGAGG CGGGCTACTG CCGCCTCGAC
CAGCTGGGCA TGGTCAACGT GGCTGTCCAG CTCAGCAAGC TCGACAACCA TCCGGTGGTT
GGTCCTGCCA TCGCCGCCGG GCAGGTCCAG GCCACCGGGC TTTTCTACGA CATCGCCACT
GCCCGGGTGG TCCTGGTGAC GCCTCACGGC ATCGAGTCCC TGGACCCGGC CCAAGCCCCG
GTGCAGGCCA CGGGCGCAGC AGCCCGCTGA
 
Protein sequence
MTPGPATTTA SNHSPPGGRR ERANRIRPFL SNLGADVPAS LVVFLVALPL SLGIAAASGA 
PIMAGLIAAA IGGIVAGSLG GAPLQVSGPA AGLTVIVAGL VQEFGWQATC AITAAAGVVQ
LLLGVSRVGR AALAVSPVVV KAMLAGIGVT IMVQQIHVLL GSGPAGSAIE NLANLPAAIT
NVEIHAALLG LTVVIILVAW KHLPAAVRKI PGPLAAVAAV TALSVPLAPA VERISFSGSI
LDAVALPALP EGNWRAIAFA VLSMALIASI ESLLSAVAVD KMHSGPRTNL NKELMGQGTA
NILSGALGGL PVTGVIVRSA TNVEAGAKSR TSAILHGVWI LIFSALFAGI IQLIPLSVLA
GLLLVIGAKL IKVADIRTSR RTGDLLIYVV TLFCVVFLNL LEGVLIGLAL AAASVLWRVL
RAAIRVHEPV SPSSAWRVTI AGSCSFFALP RLNRVLHSVP AGNNVVIELN ADYVDHAFRE
SLVAWRDQYR AAGGSVEVEE HGNTLFQDAE HRAPQRQEAR ELPLPPRNSR TADGEDASSQ
LGSERTPAVL AGISKYHRRF ADQVRPLVED LAEQQHPDTL FVACVDSRVN PNLITSSGPG
DLLTLRNIGN VVCHDGQDAS IDSALSFAVK GLEVNTIVVC GHSNCGAMKA VIADAEGAGN
PGLGTGFDAW LEHARPSYLE LMADHPVARA AAEAGYCRLD QLGMVNVAVQ LSKLDNHPVV
GPAIAAGQVQ ATGLFYDIAT ARVVLVTPHG IESLDPAQAP VQATGAAAR