Gene Arth_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4224 
Symbol 
ID4443590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp57793 
End bp59025 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content66% 
IMG OID639687749 
Producthypothetical protein 
Protein accessionYP_829446 
Protein GI116662393 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAT CAGACGGAAT AGATGAAGTT CTGGACGGCG GAATGCGGCA GTCCCTGATC 
ATCGCGTCCC GCATTGCCGA GACGCTGGCC CGTCGCCGGC AGGAGTCCCA GCGGCAGCAG
GAACATCAGG ATGCCCAGGC AGCGCACGAA GCACAGGCAC GCCTCACGGC AGACCGCAGC
GCGGCGCACG CCGCGCTGGC ACCGGTCAAT AGGGACCAGT GGTGGGACAA AGCCCAGCCC
CACGACATTG CCACGGCACA TGCCGTCGCT GAAGGCTGGA AGGACCACGA CCCGACCGCC
CTGGCGGCCT CGGAAAAGAT CCGTCAGGAA GTCTTCACGC GCTACGGCAT CGACACCCGC
GACATCGGCG CGGGCGACGC CTACCTGGAG TCAGGGATCC GGACCGCCGC TACGGAAAAA
GCCCGGCAGA GCGCACTGGA ACGCAGCCAG GAAGAAACAC GCACGGCGGC CGTCGAACAT
GAGAAGGCCA TGGGCCTGCT CGCTGCTGCC CGTGTCGAAG AACTCCGCGC CCGGGCCGCG
ACACTGGCCC CGGAAATGGA ACGCCACCAA GTGCCCATGG AGTACCTTGC CAACCCCGAG
CTCGCCCGGG CATTGCAGAC GGCACACGCC GCGAAGACCC CCGCAGCAGT GGCAGCCGCC
GACGCCACCG TGCAGGAACG CATGTTCCTC ATCGGCAAGG ACGGCATCAA CGGCCCCGAC
ATCGACCAGC TCCGTGCGGA GACCACTGCG AACGTCAACG GTGCCAAGGA TTCACACTTC
GAGGATCCCG CATTCGTCCA AGCAGCCAAG GACATGCACG AGGCCAAACT CCTGGCAGAG
GGCGGTTTCA CAGGCACGGA GCGGACGCCC GTGGAGCAGC GGTACGAACG AGCCGAAAAG
GAACTCTTTG CCCGCATGGA AAGCGTCGGC CGCGAAATCG AAAACCGCGT CACCGGCAAC
GACAACAGCC GGCTAAAAGA CCAAGGGTTG AAGGCTGAAA GCACTTCTGC CGCGGACTAT
GGATCAGCGG AGCGTCAGGA AGCATTCGCC GCGTCCCTGG CCACTACCGG CGCCAACGAA
GTGCAAGTCC GGGGACGCGC CGCCGCAGAA CGCAGCGAGG GCACACACCC CCGCGCGGCC
GTCACCATGG GCAAGGGCGC AGCCAAGGCC AAAAAGACCC GCACCAGCCT CTCGGCTAGC
GCGAAGCGGT CCCAGAGCGG TCGCAGCCGA TGA
 
Protein sequence
MSESDGIDEV LDGGMRQSLI IASRIAETLA RRRQESQRQQ EHQDAQAAHE AQARLTADRS 
AAHAALAPVN RDQWWDKAQP HDIATAHAVA EGWKDHDPTA LAASEKIRQE VFTRYGIDTR
DIGAGDAYLE SGIRTAATEK ARQSALERSQ EETRTAAVEH EKAMGLLAAA RVEELRARAA
TLAPEMERHQ VPMEYLANPE LARALQTAHA AKTPAAVAAA DATVQERMFL IGKDGINGPD
IDQLRAETTA NVNGAKDSHF EDPAFVQAAK DMHEAKLLAE GGFTGTERTP VEQRYERAEK
ELFARMESVG REIENRVTGN DNSRLKDQGL KAESTSAADY GSAERQEAFA ASLATTGANE
VQVRGRAAAE RSEGTHPRAA VTMGKGAAKA KKTRTSLSAS AKRSQSGRSR