Gene Arth_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3862 
Symbol 
ID4447561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4344622 
End bp4345848 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID639691686 
Productcarboxylate-amine ligase 
Protein accessionYP_833337 
Protein GI116672404 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACTT TCGGGGTTGA GGAAGAGCTG CTGATTGTGG ACCCCGTGAC CGGGGAGCCG 
CTGGCACTGG CGGACGCCCT GCTGACAGGG CGGAAGCTTG CTGCGGACGA TGCTCCGGAC
AAACCCCGGA TCCTGGACCC CCACGATCCA ACCCGTGACG ACGGCGACAC CGGGCTCACT
GCCGAACTGA AACTTGAACA GATCGAGACG CAGACCCGTC CGTGTCTGAA CTATGAGGAG
CTGCTCCTCC AGATCCGCCA GGGCCGGGCC CTGGCAGATA CCGCCGCGGA GAAACACAAT
GCGCGGGTGG CCGCGCTGGC AACATCGCCG ATTGCCTCCA CGACGCACAC CACACCGAAC
CCCCGCTATG CCACCATGCA GGAACGCTTT GGCCTCACCG TCCATGAGCA GCTGACCTGC
GGTTTCCATG TCCACACCTT CGTCGAATCC CCGGAAGAAG GCGTGGCTGT CATCGACCGG
CTCAGGGACA AGCTGGCGGT GCTCACGGCG CTCAGCGCAA ATTCGCCGTA CTGGAACGGC
GTGGAGACCG GCTTCGAGAG TTACCGCACG CAGGCCTGGA ACCGCTGGCC GACGTCGGGC
CCGTCCCAGA TCTTCGGGAC GCACTCCATG TACCGCCGCG TGGTCACCCG GCTGCTGGAC
AGCGGCGTGC TGCTGGACGA GGGCATGATC TATTTTGATG CGAGGCTCTC CCGGAACCAC
CCCACCGTGG AAGTCCGGGT GGCGGACGTT TGCCTGCAGG CCGAGGACGC CGCCCTGATC
GCCGTGCTGG TGCGGGCGCT GGTGGAATCG GCCAGCAGGG AATGGCGGGC CGGTGTAGAC
CCCGCGCCCG TGCCGACGGT GCTCCTGCGG ATGGCCGCGT GGCAGGCAAG CAACTGCGGA
CTCCGGGGAG ACCTTCTGGA TTTCGGCACG TTCCGCCCCG CTCCCGCCGA GGAAGTCGTG
GAGGCGCTGG TGGACTACGT CGCGCCCGTC CTGGCGGAAC AGGACGAGCT GGAACTGGCC
TGGGAAGGCG TGCGGAGGAT CCTGGACCGG GGGACCGGTT CGGAACAGCA GCGGCTTGCC
ATGCAGGAAT GCCTTGCCGG GAACCCGGAG GCCGCCGCCG GGCTGGCCGC CGTGGTTGCC
CACGCGGTGG ACGTGAGCAT GCGCCGGACC GAAGCCGTCA CCGCGCGCGA GAAGGCGCCG
GTGCTGCTGC GCGTTCGCCA GTCCTGA
 
Protein sequence
MRTFGVEEEL LIVDPVTGEP LALADALLTG RKLAADDAPD KPRILDPHDP TRDDGDTGLT 
AELKLEQIET QTRPCLNYEE LLLQIRQGRA LADTAAEKHN ARVAALATSP IASTTHTTPN
PRYATMQERF GLTVHEQLTC GFHVHTFVES PEEGVAVIDR LRDKLAVLTA LSANSPYWNG
VETGFESYRT QAWNRWPTSG PSQIFGTHSM YRRVVTRLLD SGVLLDEGMI YFDARLSRNH
PTVEVRVADV CLQAEDAALI AVLVRALVES ASREWRAGVD PAPVPTVLLR MAAWQASNCG
LRGDLLDFGT FRPAPAEEVV EALVDYVAPV LAEQDELELA WEGVRRILDR GTGSEQQRLA
MQECLAGNPE AAAGLAAVVA HAVDVSMRRT EAVTAREKAP VLLRVRQS