Gene Arth_4351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4351 
Symbol 
ID4443462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp90490 
End bp92001 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content64% 
IMG OID639687672 
Producthypothetical protein 
Protein accessionYP_829369 
Protein GI116662315 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.646589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACG CAATCTGGAG CGCGCTCGGC GAGGCCCTCA GCCCGATGTC GCTAATCATG 
CTGCTTATCG GCGTTGTGAT CGGCTTTCTC GTCGGCATCC TCCCGGGCAT CGGCGGTGCG
GTCACGCTCG CCCTGCTCAT CCCCGTGACC TTCGGCATGG ATCCGGTGCC GGCCTTCTCG
CTTCTGCTGG GCATGTATGT CGTCTCCGCG ATCGCCGGCG ACTTCACCTC AATCCTGTTC
GGGATCCCCG GTGAGCCAAC GGCAGCAGCC ATGGTGCTCG ACGGCTATCC GCTTAATAAG
AAGGGGCAGG CCGGGCGCGC GCTTGGCGCC TCCCTGTCAA GCTCGGCGAT AGGGTCGATC
TTCGGTGCGG TCCTGCTCAT GGCGCTGATC CCGGTGATGC GGCCGCTCAT CTTGAGCATC
AGCGCGCCCG AACTGTTCGC CGCGGCCGTG CTCGGCCTGA CGTTCATCGC GTCGCTCTCC
GGCGGTATCA TCCACAAGGG CCTGGTCATG GCCACGATCG GCGTGCTCCT CTCGCTCGTC
GGGCTCGATC CGAACCTCGG CATCGAGCGG TACACCTTCG GAAGCCTGCA CCTCTGGGAG
GGCATCGGCA TTGTCCCCGT CGTCGTCGGT CTCCTCGGCG GTGCCGAAGT GCTGCAGTCC
ATGCTCGACA AAGACGGTGA CACCAAGACG GCCGTGCCGC CACACCTTGG CGGCATCCTC
GTCGGCGTCA AGGATTCCTT TCGCCACTGG TCGTTGATGC TGCGCACCAG TGCCATCGGT
GCCGTCCTGG GAATGATGCC CGGTCTCGGA GGATCGGTGT CCCAGTTCAT CGCCTACGGC
CACGCCAAGC AAACCTCGAA GCATCCCGAG GAGTTCGGCA ACGGTTCGAT CGAGGGCGTG
ATCGCCGCAG GTGCGACCAC GACAGCGAAG GACGGGGGCC ACCTCGTACC GACGATCGCC
TTCGGCGTGC CGTCGGGCGC GAGCATGGCG GTGCTACTCG GCGCGTTCCT GATCCTCGGC
CTGAATCCTG GCCCCGAGAT GCTGGGCGAG CACCTCAACG TGACGCTCTC GCTCGTGTGG
ATCATCGTGC TGAGCACGAT CGCGGCTGTC ATCCTCGGCT ACCTGCTGAT CCGTCCCCTT
GCCAAGCTCA CCTCGGTGAC CGGAAGGCTG CTCGTACCGT TTCTCATCAC GATGCTCACC
ATCGGCGCAT TCTCGAACAC CAGCTCGCTC GACGACGTCT GGATCATGCT CGTCTTCCTC
GCCATCGGCG TGATGTGCAG CCGCTTCAAG TGGCCCCGCA TCCCGCTGCT GCTTGGTCTG
GTGCTCGGCG AGATCCTCGA GCGGTACTTC ACCGTCAGCT ACGCGCTGTT CCAGTTCAAC
TGGCTCAGCC GGCCCGGCGT GATCGTCATC GAGGTCATCA TCGCCGGCAT GATCATGTTC
ACCGCGGTCC GCACCGCTCG CAAAAAGCGC GCCGATAAGC GCGAGCTCCT GGCCACCGGA
GGTCTCGTAT GA
 
Protein sequence
MLDAIWSALG EALSPMSLIM LLIGVVIGFL VGILPGIGGA VTLALLIPVT FGMDPVPAFS 
LLLGMYVVSA IAGDFTSILF GIPGEPTAAA MVLDGYPLNK KGQAGRALGA SLSSSAIGSI
FGAVLLMALI PVMRPLILSI SAPELFAAAV LGLTFIASLS GGIIHKGLVM ATIGVLLSLV
GLDPNLGIER YTFGSLHLWE GIGIVPVVVG LLGGAEVLQS MLDKDGDTKT AVPPHLGGIL
VGVKDSFRHW SLMLRTSAIG AVLGMMPGLG GSVSQFIAYG HAKQTSKHPE EFGNGSIEGV
IAAGATTTAK DGGHLVPTIA FGVPSGASMA VLLGAFLILG LNPGPEMLGE HLNVTLSLVW
IIVLSTIAAV ILGYLLIRPL AKLTSVTGRL LVPFLITMLT IGAFSNTSSL DDVWIMLVFL
AIGVMCSRFK WPRIPLLLGL VLGEILERYF TVSYALFQFN WLSRPGVIVI EVIIAGMIMF
TAVRTARKKR ADKRELLATG GLV