Gene Achl_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3056 
Symbol 
ID7294536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp3398449 
End bp3400539 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content71% 
IMG OID643591466 
ProductLPXTG-motif cell wall anchor domain protein 
Protein accessionYP_002489106 
Protein GI220913797 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTGG CGTCAGTGCG CAGGTACCTC TCCCTGGGTG CCGTAGCAGG CATCCTGGCG 
TCGTCGGCCT ACGTGGCTGC GCCTGCCTGG GCTGCCGATG GTCAGGCCGC AACCACCGAC
TCAACGGAGG CGCCGGCCGT TGGAACACCG GGTTCCAGGC TGTCGGAGGC CGCCCTCGAC
GAGGCGGTCC AGCGGGACCT CGGACTCACC CGGGAACAGT TCACCGCAGC CGGGGAGCTC
GGCGCCCAGG CGGCAGCCGC GGCCGCCCAG TTGCGCGACG TGCCCGGTTA CGCCGGGATC
CGGCTGGACG GCGGGCACGT GGTGGTCACA GGATCCGGGC CGGAGTTCCT CGCCAAAGTG
GCCGCGCTGG CTGAGGCCCT TCCGGACCTC GCCGTCGAGG CCCCGCAGGC CAGGCGTGCC
GACGGCGGCG TGGCACCGTC CGGCGGGGTA GCTGATGGCT CACAGCTGGC GGTCAGCACC
GACCAGCTCT TCCAGGCATA CGTGCGCGAC GTCGGCCCCG ACGGCCTGCA GGCAGTCATG
GCCGCGGACG GGAAATTCGT CATCCGCACC GGGGCGCTTA ACGTACCCGA ATCGGCCGGA
CAGGCCGGCG GGGTGGCGGC TGACGCCTCA AGGGACGACG CAGCAACCAC GCCAGGAACG
ACGGCGGCGG GCACGACGGC GGCCGGCGGC TCCGGCAGGG TCTCACCGGC TGATTTCGTG
GCCCGGTACG CCAACGTGGA GCTCGACGGT GCCGCTCCCC TCAGACCCGA GGCCGATGTT
CCCGGCGGCG TGGGCTACAT CGCCGATACC GGGTGGATCT GCTCCACCGG GTTCTCCGCC
TTCGACCCCG CGGGACTGCC TGCCGTTCTC AGCGCAGGGC ACTGCGCCTC GGACGGGCAG
GCGGCAACGG CAACCCTGGA GTTCCAGTTC GTCCGGAAGG ACCAGCTGGG CGACTTCGGA
TTCAGCCAGT TCGGCGGCCC CGGCAACTCC CGCATCATTG AGAGCCCTGT GGACCCCGAC
AACCCCGGAA ACGTGGGCAC GGACATCTCG GTGATCCACA ACATCCCGGA AGGCCTGGAT
CCGCTGCCCG CCGCCAGCAC CTGGGGTGAC CCTTCGCAGC CCGGCCCTGA TGTGAAGATC
ATCGGCACCG CCGATCCTGT CCTGGGGATG CCCATCTGCC GCTCCGGCCG GACCTCGGCC
TGGTCCTGCG GCACAGTGGA CGCTGTGGGT ATCTACGTGG TGCCAGGCCC GGACTTCGCA
ACTGACCCCA CCGACCTCCG GGCCTTCCGG GGATTCCTTT CCTACGGCGT CCAGTCCAGC
GGCGGAGACT CAGGCGGCCC CTACGTCAGC GGCAACTACG CTGTGGGAAC GCACGCCGCC
GGTGACGCGC CGCCCGCACC GGGTGAGCCG GCACCCGCAA ACTTCGCCGT GGGTGCCACC
CTCTCGGAAA GCCTCGCGGT CCTGCCCGGC TACCAGTTGG AACTGTTCCT CAACAAGCCT
GCCGTCAGCT CGCCCGCCCC CGGCGGAACC TACGAACCAG GGCAGGCCAT CAGCGGCAAT
GTCCCGGCAG CCCCCGCGTC CGCCGTTGCC GCAGGCTCCA CGGTCAGGAT CACGGTGGAG
GGCAAGGACC CGGTGGAGGT TCCTGTCGAC GCCGCGGGCA AGTGGAGCTT CACCGCACCG
GAAAGCAACG GCCCGCTCCG CTTCACGGCC GAGACGGTCA ACGGCTTCAG CACCTCGGGC
AGCGCCGAGT TTGAGTTTGC CCGGACCGCT GCGGAGCCGC CTGCACCGGA GCCGCCTGCC
CCCGGCAACC CGGGTCCCGT TGTTCCTGCC CCGGAACCGC CCGCCCCGGA ACAACCCAAT
CCCGTGCCCG CTGACCCGGC CCCGGCTGCA CCAGCCCCCG CCGCACCGCA GCCGCCCGCG
GCAGCCGGCA TCGTCGTCGT ACCTTTCGAT TCGGGGCCGG GCCCGGCCGA CCTGGCGAAC
ACCGGCTCCA CCGGGCTGGT TCCCGCAGCC CTGGCAGCGG CCGCGGCGCT CGCCGTCGGC
ATTATCCTGA CAGTCCTGGT GCGCCGCAGG GGCCGGCGTT CGGCGAAATA A
 
Protein sequence
MTLASVRRYL SLGAVAGILA SSAYVAAPAW AADGQAATTD STEAPAVGTP GSRLSEAALD 
EAVQRDLGLT REQFTAAGEL GAQAAAAAAQ LRDVPGYAGI RLDGGHVVVT GSGPEFLAKV
AALAEALPDL AVEAPQARRA DGGVAPSGGV ADGSQLAVST DQLFQAYVRD VGPDGLQAVM
AADGKFVIRT GALNVPESAG QAGGVAADAS RDDAATTPGT TAAGTTAAGG SGRVSPADFV
ARYANVELDG AAPLRPEADV PGGVGYIADT GWICSTGFSA FDPAGLPAVL SAGHCASDGQ
AATATLEFQF VRKDQLGDFG FSQFGGPGNS RIIESPVDPD NPGNVGTDIS VIHNIPEGLD
PLPAASTWGD PSQPGPDVKI IGTADPVLGM PICRSGRTSA WSCGTVDAVG IYVVPGPDFA
TDPTDLRAFR GFLSYGVQSS GGDSGGPYVS GNYAVGTHAA GDAPPAPGEP APANFAVGAT
LSESLAVLPG YQLELFLNKP AVSSPAPGGT YEPGQAISGN VPAAPASAVA AGSTVRITVE
GKDPVEVPVD AAGKWSFTAP ESNGPLRFTA ETVNGFSTSG SAEFEFARTA AEPPAPEPPA
PGNPGPVVPA PEPPAPEQPN PVPADPAPAA PAPAAPQPPA AAGIVVVPFD SGPGPADLAN
TGSTGLVPAA LAAAAALAVG IILTVLVRRR GRRSAK