Gene Arth_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1238 
Symbol 
ID4446267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1361427 
End bp1362956 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content67% 
IMG OID639689046 
Producthypothetical protein 
Protein accessionYP_830732 
Protein GI116669799 
COG category[S] Function unknown 
COG ID[COG2327] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.235902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGTTCG TGGTGCAGGG CGATCTCGGC CAGGGTGCAT ACCACATTGG CGATGAGGCC 
ATGACACTGG CGGCAGTGGA TGAACTTTCC CGGCGCACCG GGGCCTCCTT TGTGCTGATG
TCCCGAGATC CGGAACAGAC CACGGAGCTT TACGGCACAG GGTCCATGGC CACCATCGAA
TTCCCGTGGC CGCCCGCCGA GCGGAGTGCT TACCTGGAAC TCGTACGCCG GGCCGCCAAG
GGAGACCGGA CTGCCCTCCC CGCTACCGAC CACGTGTGGG CCGTCATCGA GGAGATCTCG
ACGGCGGACG GCGTCCTGAT TGCCGGCGGC GGCAACATGA ATTCCCTCTA CGGCTGGCTG
TTGTACGAGC GTGCCGCCGT CGGCATCATT GCCAGGGAAC TCGGCAAGCC GCTGGTGGTC
AGCGGACAGA CGTTCGGGCC TACGCTGCTG CCGCAGGACC GGAAAATACT TCACGAACTC
CTCGACAGCG CCGCACTCGT CGGCGCCCGC GAACCCGTAT CCTACGCCCT GGGATTCGAA
TTGGGCCTCG GCTCGGACAA GCTGGTCCGC GTCCTGGATG ACGGCAGCTT CCTGCGCTCG
CAGACGGACC CGGCACCGGC GGGCGCAGAC GGCGGCCTGC CGGAGCTGCC GGCCGACGGT
TACATCGCGG CCACCGTGGG CCCGGACGCA TGGCGCGAAG GAACGCGCAC CCTCGCCCCG
TTCGCAGCCG TGCTGGACCG CGCGGCAGAG GTCACCGGCC TCCCGGTGTA TCTGCTGCCC
CACATGGGCA CGCTTGGATC CTCGGACAGC GGGGGCGACC ACGACTCCCA CCGGACCGTG
CTGGCGCACT CGCGGTCGGG AAAGCTGAAG ATGCTGCCCG TGCTGCCGGT GCGCACGGCG
GTGGCCGTCA CCGCCGGCGC CCGCCTGGTG GTCACCAACC GTTACCACCC TGCCGTTTTC
GGGCTGGCTG CAGGTGTACC CGTCGTCTCC CTGGCCAATG ATGCCTACTC GGACATCCGT
CTTTCCGGTG CACTAGGAAA CTGGGGTCTG GGCGATTGGG CACTGCCGCA GCCAAGCCTT
TCCCCCGGCG GTGTGGAGGA GGCCGTTGCG GAGGCCTGGC GGCGGCGAGA CGAGATCGGG
CAGCACCTTG CACGGCTGCG GCCCGGATTC GAGCGTTCCC AGGCGACATG GTGGGATGCC
GTGGCCGAGG TCCTGCGCGG CGTCGGAACC GACGACGAAC CTGGCACCCG CTACCGGGGA
CTTGACGAGG CGCCGCCGCT AAGCGCGGCG GAAACCTGGT CCCGCCAGGC TACGGAGCAG
CGGGCGCTGT TCCGCACCTT GAGCCTGGAA ATCGGACGGC AGTGGACGGC ATGGGACGAT
GTGCGGTCCC AACGCGATGT GCTGATCCAT GAACGGGATG AAGCACTACG GGAAAAGGAC
AGGATCCTGC AGTCACGAAC TTTCAAAGCA GCAAGAATAT TCGGCCGCGG CGCGGACTTT
GCGCGCCAGC TGACCGGAAG GAAACACTGA
 
Protein sequence
MKFVVQGDLG QGAYHIGDEA MTLAAVDELS RRTGASFVLM SRDPEQTTEL YGTGSMATIE 
FPWPPAERSA YLELVRRAAK GDRTALPATD HVWAVIEEIS TADGVLIAGG GNMNSLYGWL
LYERAAVGII ARELGKPLVV SGQTFGPTLL PQDRKILHEL LDSAALVGAR EPVSYALGFE
LGLGSDKLVR VLDDGSFLRS QTDPAPAGAD GGLPELPADG YIAATVGPDA WREGTRTLAP
FAAVLDRAAE VTGLPVYLLP HMGTLGSSDS GGDHDSHRTV LAHSRSGKLK MLPVLPVRTA
VAVTAGARLV VTNRYHPAVF GLAAGVPVVS LANDAYSDIR LSGALGNWGL GDWALPQPSL
SPGGVEEAVA EAWRRRDEIG QHLARLRPGF ERSQATWWDA VAEVLRGVGT DDEPGTRYRG
LDEAPPLSAA ETWSRQATEQ RALFRTLSLE IGRQWTAWDD VRSQRDVLIH ERDEALREKD
RILQSRTFKA ARIFGRGADF ARQLTGRKH