Gene Achl_1110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1110 
Symbol 
ID7292555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1222755 
End bp1223996 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content69% 
IMG OID643589516 
Producthypothetical protein 
Protein accessionYP_002487191 
Protein GI220911882 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4692] Predicted neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CCACGGACAG CTACAGCATC ATCACCCCGG ACGGGACGGT CAAGACGGCC 
GACGGCGCCG ACTTCGCCTA CCTGCCCGCG CCCACCGTGC AGAGCCACGC GGCCAACCTG
CTCACCCTCC CGGACGGCCG GCTGGGCTGC GTCTGGTTCG GCGGCACCCA GGAAGGCGTG
CCGGACATCT CCATCTGGTT CTCCGCCCTG GAACCGGGCA GCAAGCAGTG GTCCGCCCCG
GAGCAGCTCT CGGACGACTC CACCCGGTCC GAGCAGAACC CCATCCTGTT CACGAATACT
GACGGCGCTC TCTGGCTGCT GTACACCGCG CAGAAGGCGG GCAACCAGGA CACGGCCGAG
GTCCGCCGTC GTATTTCCCT GGACAGCGGC CGCACCTGGG GCGACGTGGA GACCCTGTTT
GCCGCCAACG AGACCGGCGG CGTGTTCGTC CGCCAGCTGC CCGTGGTGCT GCCGTCCGGC
CGGCTGATCA TCCCGATCTT CCGCTGCATC ACCACACCGG GGGAGAAGTG GGTGGGCAAC
AGCGACGACA GCGCCGTGAT GATCTCCGAC GACGCCGGCG CCACTTGGAC CGAAACCGTC
CTGCCGGGCA GCCTGGGCTG CGTCCACATG AACATCCAGC CCGTGGCCGA CGGCTCGCTG
CTGGCCCTGT TCCGCAGCCG CTGGGCCGAC TCGATCTACG AATCCCGCTC CACCGATGAC
GGGTCCACCT GGAGCGAACC GGTCCCCACC GAGCTGCCCA ACAACAACTC GTCCATCCAG
TTCGTCGCGT TGAAGGACGG CCGCCTGGCC CTGGTCTACA ACCACAGCCG GGCGGGCGAA
GGCACCGAGC GGCGCCTCTC GCTGTATGAC GAGATCGACG ACGACGGCCT GGCCGACGAA
CAGGGGCAGG TGGCCGAACC GGACGCCTCG GCTTTCTCCG AGGACGACGG CGTGAAGCGC
GCCTTCTGGG GCACGCCGCG CTCGCCGATG ACCCTGGCCA TCTCCGAGGA CTCCGGCCGC
AGCTGGCCCA TCCGCCGGAA CCTGGACGTG GGGGACGGGT ACTGCCTGTC CAACAACTCA
CGCGACGGGC TCAACCGCGA ATACTCCTAC CCGTCCATCC ACCAGGGCCC GGACGGCGCG
CTGAACATCG CGTACACGTA CTTCCGGCAG GCCATCAAGT TCGTCCGGGT TGACCCGCAG
TGGGCCTACG ACGGCACCAC CACGCCGGGC GGGGACGCAT GA
 
Protein sequence
MTTTTDSYSI ITPDGTVKTA DGADFAYLPA PTVQSHAANL LTLPDGRLGC VWFGGTQEGV 
PDISIWFSAL EPGSKQWSAP EQLSDDSTRS EQNPILFTNT DGALWLLYTA QKAGNQDTAE
VRRRISLDSG RTWGDVETLF AANETGGVFV RQLPVVLPSG RLIIPIFRCI TTPGEKWVGN
SDDSAVMISD DAGATWTETV LPGSLGCVHM NIQPVADGSL LALFRSRWAD SIYESRSTDD
GSTWSEPVPT ELPNNNSSIQ FVALKDGRLA LVYNHSRAGE GTERRLSLYD EIDDDGLADE
QGQVAEPDAS AFSEDDGVKR AFWGTPRSPM TLAISEDSGR SWPIRRNLDV GDGYCLSNNS
RDGLNREYSY PSIHQGPDGA LNIAYTYFRQ AIKFVRVDPQ WAYDGTTTPG GDA