Gene Arth_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3663 
Symbol 
ID4443664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4117975 
End bp4119015 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content66% 
IMG OID639691487 
Producthypothetical protein 
Protein accessionYP_833138 
Protein GI116672205 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAT ACCTTCCCGG CATCATCTGG CTGGTGGTGC TCCTGGTGGT CAACGCCTTC 
TTCGTCGGGG CCGAGTTCGC TGTCATCTCC GCCCGCCGGT CGCAGGTCGA GCCCAAGGCC
GAGGCCGGCA GCAAGGCCGC GAAGACCACG CTGTGGGCCA TGGAGCATGC CACGCTGATG
CTGGCCACCA GCCAGCTGGG CATCACCGTG TGCTCGCTCG TGATCCTGAA CGTTTCCGAA
CCCGCCATCC ACCACCTGCT GGAAATCCCC CTGGGCCTGA CCTCGCTCTC CGGCGAAGCG
ATCGGGATCA TCGCCTTCGT GGCCGCGCTG TTGCTGGTGA CCTTCCTGCA CGTGGTCATC
GGTGAAATGG TGCCCAAGAA CATCTCGTTC TCCGTTCCCA CCCGGGCCGC GCTCATCCTT
GCCCCGCCGC TGGTGATGGT GTCACGCTTG TTCAAGCCGG TGATCTGGAC CCTTAACGGG
ATCGCAAACT CCATCCTGCG GCTCTTCAAG GTCCAGCCCA AGGATGAGGC TACCAGCGCC
TACACCCTGG ACGAGGTGGC CAACATCGTG GAGCAGTCCA CCCGGGACGG CATGCTCACG
GACACCACCG GCACGCTGAA CGCAGCGTTC GAATTCACCG CCAAGACCGT GGCGGACGTG
GAAGTGCCGA TCAGCGAGAT GGTGCTCCTG CCGGCCTCGT CGACGCCGGC GGACATCCAG
AGCGCGGTGG CCCGGCACGG GTTCTCCCGC TACATCCTGA CGGACGACGA CGGCGTGCCC
TCCGGCTATC TGCACCTCAA GGACGTCATG GACCTGACGT CCCCGGAAAA ATTCGCCAGG
CCCGTGCCGG CCAAGAGAAT CCGACGGCTC GCCTCCGCGT TCAGCGGCAG CGACCTCGAG
GACGCGCTGG CCACCATGCG CCGCACCGGC GCCCACGTGG CCCGGGTCTT CGACGCGGAC
GGGAAGACCA CCGGCGTCCT CTTCCTGGAG GACATCATCG AAGAGCTGGT GGGCGAAGTG
CAGGACGCCA CGAGCGCCTA G
 
Protein sequence
MSEYLPGIIW LVVLLVVNAF FVGAEFAVIS ARRSQVEPKA EAGSKAAKTT LWAMEHATLM 
LATSQLGITV CSLVILNVSE PAIHHLLEIP LGLTSLSGEA IGIIAFVAAL LLVTFLHVVI
GEMVPKNISF SVPTRAALIL APPLVMVSRL FKPVIWTLNG IANSILRLFK VQPKDEATSA
YTLDEVANIV EQSTRDGMLT DTTGTLNAAF EFTAKTVADV EVPISEMVLL PASSTPADIQ
SAVARHGFSR YILTDDDGVP SGYLHLKDVM DLTSPEKFAR PVPAKRIRRL ASAFSGSDLE
DALATMRRTG AHVARVFDAD GKTTGVLFLE DIIEELVGEV QDATSA