Gene Arth_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3066 
Symbol 
ID4444265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3438408 
End bp3440084 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content65% 
IMG OID639690892 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_832545 
Protein GI116671612 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.479156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCTT ATACAGCCGG GGACACTGAC GTCCCGCTGC TTGACGAGAC AATCGGCCAG 
AATTTCGAGC GGATGGTGGC CAGGTTCCCG GTCCATGACG CCCTGATAGA GGCGGCTTTG
GTACCCGGAG CGGACGCCCG CCGCTGGAGC TACACCAAGC TGAACGACGA CGTCGACAGG
TTGGCCCGGG CGCTGATCGC CCGCGGCGTC GCCACCGGGG ACCGGATCGG CATCTGGAGT
CCCAACTGTG CGGAGTGGAC CATCCTGCAG TACGCCACGG CCAAGATCGG AGCGATCCTG
GTCAACGTGA ACCCGTCCTA CCGGAGCCAC GAACTGGAGT TCGTGGTCAA GCAGAACGGC
ATGAGGATGC TGGTGGCGGC GCCGTCGGAC CGGAGCAGCG ACTACACGGC GATGGCCAGG
CAGGCCCTCG CCGCATGCCC GGAGTTAAAG GAGCTGGTCT TCCTGCCCGA TGGCGGCCAG
CCGCAGCTGC AGGCGGGTGA TCCTGAAACT GCTGCGGAAA TGACTTACGC CGAGCTTCTC
AAGCGGGCGG ACGACGTCGG GCATTCCGTT CTGAAGGCCC GCCTGGCGGG GCTGGACCCG
CAGGACCCCA TCAACCTGCA GTACACGTCG GGCACCACAG GGTTCCCCAA AGGTGCCACC
CTGACCCACC GCAACATCCT GAACAACGGC TATTCCATCG GCGAGCTGCT GGGCTACACG
GAGCATGACC GCGTGGTGAT TCCGGTGCCG TTCTACCATT GCTTCGGGAT GGTGATCGGG
AACCTGAATG CCCTCAGCCA CGGCGCCGCG ACCATCATTC CGGGACGCAC CTTCACTCCG
GCGGCAGCGC TCGAAGCGGT CCAGGACTTT GGCGGTACAT CGCTGTACGG CGTGCCCACG
ATGTTCATTG CCGAGCTCGC GCTGCCGGAC TTCGCATCCT ATGACTTGTC CACGCTGCGC
ACCGGTGTGA TGGCCGGATC CCTGTGCCCT ATCGAGGTGA TGAACCGGGT CATCTCCGAC
ATGAACATGA AGGATGTGGC CATCTGCTAC GGCATGACGG AAACATCCCC GGTATCCACC
ATGACCCGCG CCGACGACAC CCTGCAGCAG CGCACTGAAA CCGTGGGACG TACCATGCCG
CAGCTGGAGA GCCAGGTGGT GGACCCGGCG ACAGGCGAGG TGCTGGAGCG GGGGGAAATC
GGCGAGCTGT GCACCCGCGG GTATGCCGTG ATGAAGGGAT ATTGGAACCA GCCGGACAAG
ACCGCCGAGG CGATCGACCC GGACGGCTGG ATGCACACCG GGGATCTTGC CCGGATGGAC
GCGGACGGCT ACGTGGTGAT CGAAGGCCGC ATCAAGGACA TGGTGATCCG CGGCGGCGAG
AACATCTACC CGCGGGAGAT CGAGGAGTTC CTGTACACCC ACCCGTCCAT CCAGGACGTG
CAAGTGATCG GGGTGCCGGA CGCCAAGTAC GGCGAGGAGC TTATGGCCTG CATCATCGTC
AAACCCGGCG CGGATCCGCT GGACGCGGCC GACGTCGCCG AGTTCTGCCG CGGCAAGCTG
GCGCATTACA AGATCCCGCG CTACGTGGAG GTCCGGGACA GCTTCCCGAT GACGGTGTCA
GGGAAGATCC GGAAAGTGGA GATGCGGCAG GAGGCCGTGG CCCGGCTGGG ACTGTAG
 
Protein sequence
MRAYTAGDTD VPLLDETIGQ NFERMVARFP VHDALIEAAL VPGADARRWS YTKLNDDVDR 
LARALIARGV ATGDRIGIWS PNCAEWTILQ YATAKIGAIL VNVNPSYRSH ELEFVVKQNG
MRMLVAAPSD RSSDYTAMAR QALAACPELK ELVFLPDGGQ PQLQAGDPET AAEMTYAELL
KRADDVGHSV LKARLAGLDP QDPINLQYTS GTTGFPKGAT LTHRNILNNG YSIGELLGYT
EHDRVVIPVP FYHCFGMVIG NLNALSHGAA TIIPGRTFTP AAALEAVQDF GGTSLYGVPT
MFIAELALPD FASYDLSTLR TGVMAGSLCP IEVMNRVISD MNMKDVAICY GMTETSPVST
MTRADDTLQQ RTETVGRTMP QLESQVVDPA TGEVLERGEI GELCTRGYAV MKGYWNQPDK
TAEAIDPDGW MHTGDLARMD ADGYVVIEGR IKDMVIRGGE NIYPREIEEF LYTHPSIQDV
QVIGVPDAKY GEELMACIIV KPGADPLDAA DVAEFCRGKL AHYKIPRYVE VRDSFPMTVS
GKIRKVEMRQ EAVARLGL