Gene Arth_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0195 
Symbol 
ID4447353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp202844 
End bp204019 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content64% 
IMG OID639687990 
Productfatty acid desaturase 
Protein accessionYP_829696 
Protein GI116668763 
COG category[I] Lipid transport and metabolism 
COG ID[COG3239] Fatty acid desaturase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.670555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAG TTACCAACCA CGAAGCCGAG ACCACGGAAG ACGCCGTCGT GCCCGCGAAG 
ACGCGGCGAG GTGCGCTGGC CGCGTCCGGC AGCCCCCTGA TCCGTCCGCC GGCCGCAGCA
CACCTCTCCG ACGAGCAGGT GGCTGAGCTG GGCCGCGAAC TTGATGCCAT CAAGGACGAC
ATTCTGGCGA AGCGCGGCGC CTCCGACGCC GCGTACATCC GACGGATGAT CAAGATCCAG
CGCGGGCTGG AGATCTCCGG CCGCGCGACG CTCCTGGTGA GCCGGAACAA GGCTGCCTGG
ATTACCGGCA CCACCCTGCT CAGCCTGGCC AAGATCCTGG AAAACATGGA GATCGGGCAC
AATGTGCTGC ACGGCCAGTG GGACTGGATG CGGGACCCGG ACATCCACTC CACCACCTGG
GAATGGGACT TTGTCACCCC GGCACGCGCC TGGCAGCACA CCCACAATGA CCTGCACCAC
CGCTGGACCA ACGTGGTGGG CAAGGACAAC GACGTCGGAT ACAACCTGCT GCGGATGGAC
CCCCAGCAGG AGTGGAAGCC GTTCAACCTC GGCAACCCGC TGTATAACGC CATCCTGGCA
CCGGTCTTCG AGTGGGGCAT CGCGATCTAC GACCTTGAGC TCCAGGACTA CAAGGAAGGC
AAGAAGTCCA AGGAAGCGCT GGTCAAGGAC CTCAAGGCCC TGGGCGTCAA GGCGCTCAAG
CAGTTCACCA AGGATTACGC CGCCACTCCC GCCGTCGCGA TGCTGACCGG CTCGGGCAAG
CAGGCGCTCT ACGGCACGCT GACCGCCAAT GCGGTGCGCA ACGTCTGGGC CCACGCGGTG
ATCTTCTGCG GGCACTTCCC CGAGGGGACG GACACGTTCA CCGAGGAAAT GGTGGAAGGG
GAGACCCGCG GGGACTGGTA CGTGCGCCAG ATGATCGGCT CAGCCAACAT CTCCGGTTCC
AAGTTCATGC ACCTCATGAC CGGAAACCTT TCGCACCAGA TTGAGCACCA CCTCTTCCCG
GACATTCCGT CCAACCGCTA TGCCGAGGTG GCGCCCAAGG TGCAGGAGAT CTGCAAGCGC
TACGGCCTGC CGTACACCAC GGGCCCGATC TGGAAGCAGG TCGGCTCCAC GTGGGCCAAG
GTCTTCAAGC TGGCGCTGCC GCCCAGGAAG GCCTAA
 
Protein sequence
MAIVTNHEAE TTEDAVVPAK TRRGALAASG SPLIRPPAAA HLSDEQVAEL GRELDAIKDD 
ILAKRGASDA AYIRRMIKIQ RGLEISGRAT LLVSRNKAAW ITGTTLLSLA KILENMEIGH
NVLHGQWDWM RDPDIHSTTW EWDFVTPARA WQHTHNDLHH RWTNVVGKDN DVGYNLLRMD
PQQEWKPFNL GNPLYNAILA PVFEWGIAIY DLELQDYKEG KKSKEALVKD LKALGVKALK
QFTKDYAATP AVAMLTGSGK QALYGTLTAN AVRNVWAHAV IFCGHFPEGT DTFTEEMVEG
ETRGDWYVRQ MIGSANISGS KFMHLMTGNL SHQIEHHLFP DIPSNRYAEV APKVQEICKR
YGLPYTTGPI WKQVGSTWAK VFKLALPPRK A