Gene Arth_4278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4278 
Symbol 
ID4443529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008538 
Strand
Start bp11700 
End bp13211 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content66% 
IMG OID639687599 
Producthypothetical protein 
Protein accessionYP_829296 
Protein GI116662242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCTA TCGAGAACAC ATACAGGGAA CCGACCTACG GCAACTGGCG GCGTCCGCGC 
AAAGCGGGCA TCGGCACCCT TGGTGGGCTG GCAACAGCCG GTTTGTTTGT TGGCCTCATC
ATCACGGTCA TCTGTCTCTT TGTCGGCGGC TGGTTCGCCG GGCTCATTGC CCTGCTCGTC
CTTGGGGTGG CGCTGGCCGT CGTCTCGGTG ACCGACAAGC ACAACAAGTC CGTAGGGGAG
CGGATCTTCG CCCGGCTGGC GTTCCGCTCA GCACGGCGCA GACGCGCCAA CATCTACCGC
TCCGGTCCGT TGGGACTGAC GCCTTGGGGC GAGTTCCAGC TACCTGGAAT CTCTGCCGGT
TCCAAGCTTT ACGAGTTCAC CGACTCCTAT GGCCGCCCCT TCGCGCTGAT CCACCTTCCC
TCCACCGCGC ACTACACCGT TGTCTTCGGC ACCCAGCCCG ACGGTGCGTC CTTGGTCGAT
CCTGAGCAGA TTGATGCCTG GGTGGCTAAC TGGGGCGGCT GGCTGGCCTC GTTGTCGAAT
GAACCTGCCG TTGTGGCTGC ATCCGTCACG GTCGAAACCG CACCCGATTC CGGGGCACGG
CTGCGCCGGG AAGTTGAATC GAACATCCAT GACGACGCCC CTGACATTGC CAAGGCCATG
CTCTGGGAGG CCTTGAACAC CTACCCGCAG GGATCGGCCA CGATCCGCGC CTGGGTCGCG
CTGACGTTCA GCGCCGCTGC CCGGGCAGGC GGCAAGCGAC GCACCGCAGA AGAAATCGCA
AGGGACCTCG CTTCACGGCT TCCCAGCCTG ACCGAGCGTC TGGAATCCAC CGGCGCCGGG
GCATGCGCAC CATTGAACGC CCAGGAACTG TGTGAGGTCG TTCGGGTCGC ATACGACCCT
GCAGCGGCCC GGCTCATCGA CGAAGCCCAC TACCAGGGCA CACCCGTTGA CCTGGACTGG
GGCGACGTCG GACCATCCGC CCACCAATCC AACTGGGACG GCTACCGCCA CGATTCCGGG
CACTCGGTGT CCTGGACGAT GACAGGCGCA CCACGCGGAC ACATCCTGGC ATCAGCGCTG
GGCCGCCTCG TCGCACCGCA CGCGGAGATC GACCGCAAGC GCGTCACCTT GCTGTACCGG
CCCATCGAAG CCGGCCGCGC GGCCGCCATC GTGGAGTCGG ACCAGACCAG CGCCCTGACC
CGGGCCTCCT CGACCAACCG GCCCACCGCC CGGGCCCTGG TTGATGCCCG TGCGGCGCAG
GCCACCGCAG CGGAGGAAGC CAAGGGGGCC GGGCTGGTCA ACTTCGGCAT GGTGGTTACG
GCCACGGTGC TGTCCTCCCG TGACCTTGAG GACGCCGTGG CCGTCGTGGA GGGCAACCTC
GGCCCGTCGG CACGTCTGCT GTTGCGGCGC GCTTACGGCT CCCAGGATTC TGCCTTTGCC
GCGTCCCTGC CACTGGGTCT GGTCCTGCCC AAGCACTTGA AGGTTCCCGA AGAGATCCGT
GAGGCCATGT GA
 
Protein sequence
MAAIENTYRE PTYGNWRRPR KAGIGTLGGL ATAGLFVGLI ITVICLFVGG WFAGLIALLV 
LGVALAVVSV TDKHNKSVGE RIFARLAFRS ARRRRANIYR SGPLGLTPWG EFQLPGISAG
SKLYEFTDSY GRPFALIHLP STAHYTVVFG TQPDGASLVD PEQIDAWVAN WGGWLASLSN
EPAVVAASVT VETAPDSGAR LRREVESNIH DDAPDIAKAM LWEALNTYPQ GSATIRAWVA
LTFSAAARAG GKRRTAEEIA RDLASRLPSL TERLESTGAG ACAPLNAQEL CEVVRVAYDP
AAARLIDEAH YQGTPVDLDW GDVGPSAHQS NWDGYRHDSG HSVSWTMTGA PRGHILASAL
GRLVAPHAEI DRKRVTLLYR PIEAGRAAAI VESDQTSALT RASSTNRPTA RALVDARAAQ
ATAAEEAKGA GLVNFGMVVT ATVLSSRDLE DAVAVVEGNL GPSARLLLRR AYGSQDSAFA
ASLPLGLVLP KHLKVPEEIR EAM