Gene Arth_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1212 
Symbol 
ID4446308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1320179 
End bp1321381 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content63% 
IMG OID639689019 
Producttransglycosylase domain-containing protein 
Protein accessionYP_830706 
Protein GI116669773 
COG category[S] Function unknown 
COG ID[COG3583] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0852875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCGTTT TTCCCCGTGC CCGGATGGTC CTAGAGTTAC GGGCAATCGT GGTCAAGTTC 
TTCACTTCGG ACGGTAAGTT CAGTTTCGTC AAGGTCGGTG CCCAGCTGGT TGTGCTCTCT
GCACTCGTGC TGGGCCTGGT GGCCTTCGTA GGCAACAACA AAACAGTCAC CCTGAATGTG
GACGGGAAAG TCAGCTCCGT CCAGACGTTC GGCGGGACGG TAGGCCAAGT GGTCAAAAGT
GCCAAGGTGG AGCTGCAGGC CGCGGACCGG GTTTCCCCGT CGGCGGACGC CCGCGTGGAG
GATGGCTCGG TCATCAACGT CAATCTCGCC AAGGCAGTGA AGATCAGCCT CGACGGCGCT
GAGAAGACGA TCAACACAAC CTCTGCCAAC GTCGAAGGAC TGGTCACCGA ACTCGGCGTT
GCCAGTGCCT CGGAAGTCTC CGCGCCAAAG GACGCCCAGC TGGCCGTCTC CGGTTCGTTT
GTGGCCATCT CCACGCCCAA GACCGTCAGC ATCCTGGCGG ACGGCAAGGC GTCGAAGACA
ACCACCACGG CTTCAACCGT GGCGGAGGTC CTCAAGGACG CCGGAGTGAC CGTGGGTGCC
GGTGACCGGC TTTCCCAGCC GCGCAACGCG CACGTCGTCA ATGACATGGC GATCAAGGTC
TCCCGGGTGG ATTCCTCCAA GACTGCCGCA ACCTCCGAAG AGGTTCCCTT CGAGACCCTG
AGTTCCGAAA GCGCCGACCT GTTCGTCGGC GAGAAGAAGG TCACCCAGGC CGGTGTCCCC
GGCAAGGTGG ACAAGAACTT CAAGCTGGTG CTGGTGGATG GCCGGGAAGC CTCCCGGACC
CTCGTCTCCG AGACCGTCTC CGTCCAGCCG GTGACTGAAA AGGTCTCGGT CGGGACCAAG
GAAAAGCCCA AGGCCGAAGC TGCCGGTGCG AACACCGGTG CAGCCGCCCC CGCCATGATG
AATGAAGCCA TGTGGGACAA GATCGCGCAG TGCGAATCCA CCGGCAACTG GTCCATCAAC
TCCGGCAACG GCTACTACGG CGGTCTGCAG TTCGACATCC AGACCTGGCT CGGTGCCGGA
GGCGGCGCCT ACGCTCCCAA CGCCAGCCTT GCCACCAAGG CCCAGCAGAT CGACATCGCC
AACCGCGTTT ACGCGCAGCG CGGCCTCTCC CCCTGGGGCT GCGGCTGGGC AGCGACCAGC
TAA
 
Protein sequence
MCVFPRARMV LELRAIVVKF FTSDGKFSFV KVGAQLVVLS ALVLGLVAFV GNNKTVTLNV 
DGKVSSVQTF GGTVGQVVKS AKVELQAADR VSPSADARVE DGSVINVNLA KAVKISLDGA
EKTINTTSAN VEGLVTELGV ASASEVSAPK DAQLAVSGSF VAISTPKTVS ILADGKASKT
TTTASTVAEV LKDAGVTVGA GDRLSQPRNA HVVNDMAIKV SRVDSSKTAA TSEEVPFETL
SSESADLFVG EKKVTQAGVP GKVDKNFKLV LVDGREASRT LVSETVSVQP VTEKVSVGTK
EKPKAEAAGA NTGAAAPAMM NEAMWDKIAQ CESTGNWSIN SGNGYYGGLQ FDIQTWLGAG
GGAYAPNASL ATKAQQIDIA NRVYAQRGLS PWGCGWAATS