Gene Arth_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1208 
Symbol 
ID4446304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1313854 
End bp1316349 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content69% 
IMG OID639689015 
Producttransglutaminase domain-containing protein 
Protein accessionYP_830702 
Protein GI116669769 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTGA CTCCCCAGCG ACATCCCCCT CAGGAGACGG ACCGCACGCC TGAGGTCCTT 
GCCCCGTTAG CCGCACCGGC CGCGCGGCTG CGCCGCGCTG GTGCGCAGCC GTGGGCCATG
GCGGCTGCCG TCGCCCTCGC TGTGTCCGGA GCCGCCCTAT CCCTTAACGG CGTACTGCGG
GGCTGGGCGT GGTATCCGCC GGTGCTTACC ACCGTCTGCG TTGTTGCCCT AACCATGGCC
CTCCTGAGGT CGCTGCGCGC GCAGCCATTC CTCGTGGCGC TGGGCGGCTT TGTGTCCCTC
GCTTTCATCC TGACTTTCAC GTTCTTCCGC CGGGACAGCA TCGCCGGGTT CATTCCGTCC
GGGGACACCC TGGAACAGCT GGGGAGGTAC CTGCGGCGGG CAAGTGAAAC CGTCCTGGCC
GAGAGCTCCC CGGTTGCGCC GAATGCCGGC ATAGTGCTGG TCACGTGCGC GGTCCTGGGC
CTCGTGGTGA TCCTGGTCGA TGCCCTCGCA GTGCCGCTGG CCCTGCCGGC AACTAGCGGG
CTGGGACTGG TGGCCATCCT GGTGGTGCCC GCCATGGTCA AACCGCAGAG CGTCGGGGTG
TGGGGGTTTG CCGCAGCAGC CGCCGGCTAC CTGATGATCC TTGCCTGCAG CCAGTGGTTC
GCCCCGGACA ACCGCACTAC GGCGGACACC GCCCGCACTC CGGGACAGTT CCGCCGCGCT
GCCCTCACCG GCGGCATCGC GCTGGTCATC ACGCTCCTCC TTCCCCTGGC CGTTCCGGGT
TTCGACCAGG GCACGTTCCC ACAGGGTTCC AGGCTGAATC CTTGGGGTTC ATCCACCGGG
CTCAATCCGA TGATCAGCCT GGGCAACAGC CTGCGGAGTC CCTCCGGCGA CGGACGCATC
ACTTATGCCA CCAGTGCCAG CAGGCCCCCG TACCTGAGGT CCGTCACGGT GGACCGGTTC
GACGGCGACT CCTGGTCACC GGACGACCGC GACGCTTCCC GGCGCGTGGG GGTCGGAAGG
ATGGACGCCG GGTACGAGAT CCTCGCGGAC GAGCAGGTCC GCCAGGTCAC GGCGGTCAAT
ACCGGCCGTT TCACCAGCCC GTACCTGCCG GTGCCGTACG CCCCGGATGC CGTCAACGGC
CTGGCCGGAC GATGGAGCTG GGATCCGGCC ACGCTCAGCA TCAAGGGGGT GGACACCAAT
TCACGGAACC AGCAGTACGT CGTGCAGTCG ACTGCACCCA GGCTCACGCC GGAACTGTTG
TCACAGTCCT CGGCACCGGT GCGCGGGATC TCCGAGGAGT TCCTCCGGGC GCCCAACAAC
GTGCCGGACA TTGTTCGGAG CACGGCGGAT AAAGTGACCG CCGGAAGCAG TACGGCGTAT
GCCAAGGCGA TGGCCATCCA GAACTACCTG CGCTCGCTGG AGTTCACGTA TTCCTTGCAG
TCCCCGGTCC AGGGCGGCTA CGACGGCAAC GGACTTTCCG TACTCGCGGA CTTCCTGACG
CAAAAGAGCG GCTACTGCAT CCATTTCGCT TCGGCCATGG CCGTGATGGC CAGGCTGGAG
GGAATCCCAA GCCGGATCGC CGTGGGCTAC GCCCCGGGCC GGTCCACGGG TGCCACGGTT
TCGGTTCCCG GGCAAGGTGC ACTGACGGAA TACGAAGTCG ACTCCAGGGA CGCGCATGCC
TGGCCCGAAC TCTATTTTCA AGGACTCGGA TGGGTGGCAT TCGAACCCAC GCCTTCACGC
GGCGTGGTGC CGGACTACGC GTCGCAGACC TCCACGCCGG GGGGTGCCAG CACGAACGAA
AACAACGACG ACCTCATCCC GTCGAACGCC GCAACGCCCG GCGCAACGCC AGGCGCCACC
CCGACGCCGC TGCCGGGGTT GGAATCAGCC GGTGAACCAG GGCACCCGCT GACTCTGCCG
CTCTACGGCA CCGCCGCTGT ACTCCTCCTT GCCCTGATCG CCGGGTCGCC GCGGATGGTG
CGGGCCGGCC TGCGGTCGCG GCGGCTCAGG AGCGTGCCGG CCCTCGGGGG CAACACGGCC
GCAGCGGCCT GGGCCGAACT GCGGGACCTG GCTACCGACT ACGGGCTGGC TCCCGGCCCC
AGCGAAACCC CGCGTCACTT TTCCAGCCGG CTCAGTGGAT CGGGTGCGCT TGGCGCGCGC
GACAACGCCG GAATCCGGGC TGTCGCCGCA CTCACCACGG ACTTCGAACG TCAGCAGTAC
GGGCCGCCTG CCGGCCGTGC CGGGGGCGCT GAAGACGTCG CGGCGCCTTC AGCAGCAGAG
AGGATAGCCG TGATCCAGGC ATCCCTGCGA AGGCATTCCC GCCGGGGACG GCAGCTGCGG
GCGGAGTGGC TGCCGCCGTC GGTCATGAAT CGCTGGGGGC GCATCGTTTC CGCGCCCTTC
CGTGCCGTCC TGCGGGCCGC CGCCAAACCC GCGCAGGCCG CCGCGCGGTC CTGGCGGACG
GTGCGCGACG GCCTGCGGCG GCTGCGACGG GGCTAG
 
Protein sequence
MTLTPQRHPP QETDRTPEVL APLAAPAARL RRAGAQPWAM AAAVALAVSG AALSLNGVLR 
GWAWYPPVLT TVCVVALTMA LLRSLRAQPF LVALGGFVSL AFILTFTFFR RDSIAGFIPS
GDTLEQLGRY LRRASETVLA ESSPVAPNAG IVLVTCAVLG LVVILVDALA VPLALPATSG
LGLVAILVVP AMVKPQSVGV WGFAAAAAGY LMILACSQWF APDNRTTADT ARTPGQFRRA
ALTGGIALVI TLLLPLAVPG FDQGTFPQGS RLNPWGSSTG LNPMISLGNS LRSPSGDGRI
TYATSASRPP YLRSVTVDRF DGDSWSPDDR DASRRVGVGR MDAGYEILAD EQVRQVTAVN
TGRFTSPYLP VPYAPDAVNG LAGRWSWDPA TLSIKGVDTN SRNQQYVVQS TAPRLTPELL
SQSSAPVRGI SEEFLRAPNN VPDIVRSTAD KVTAGSSTAY AKAMAIQNYL RSLEFTYSLQ
SPVQGGYDGN GLSVLADFLT QKSGYCIHFA SAMAVMARLE GIPSRIAVGY APGRSTGATV
SVPGQGALTE YEVDSRDAHA WPELYFQGLG WVAFEPTPSR GVVPDYASQT STPGGASTNE
NNDDLIPSNA ATPGATPGAT PTPLPGLESA GEPGHPLTLP LYGTAAVLLL ALIAGSPRMV
RAGLRSRRLR SVPALGGNTA AAAWAELRDL ATDYGLAPGP SETPRHFSSR LSGSGALGAR
DNAGIRAVAA LTTDFERQQY GPPAGRAGGA EDVAAPSAAE RIAVIQASLR RHSRRGRQLR
AEWLPPSVMN RWGRIVSAPF RAVLRAAAKP AQAAARSWRT VRDGLRRLRR G