Gene Arth_4216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4216 
Symbol 
ID4443582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008539 
Strand
Start bp48948 
End bp50669 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content68% 
IMG OID639687741 
Producthypothetical protein 
Protein accessionYP_829438 
Protein GI116662385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCCCCC GCTGCCAAAT CAACCGTGTC GCCTGGACCT TCCCTCGGGT GGACTATTGC 
TATGTCTGCC TTCCCGGCGG GCCGTTCATT GCACCGCCGT GCTCCAAGTG CGGAACCGAC
ATGGGCTACT TCAGCCAGGG CATGTGTGCC GGCTGCCATC CCGGCAGCCC ACAATACCCA
GGCTCGTGCA AGGACTGTCT GGCGTGGGGT GTCTACCGCC GCTACAAGTG GACGTGCTGG
CCGTGTCGAT GGTGGCGTTC CCACTACCCG GAAGGAGTCT GCGACTTCTG CGGGCGCGCC
GCCCGTGTCG GAGAGCGGCG GGCCTGCCGG CTGTGTCTGG AGCAGGCCCG AATGGTCCAG
GAGCCCGGAC ACGGCCTGGA CCTGGCCGGC GCGAATAGGG ATGGCCACCA GCTCTTCTTC
GCCAACATGA GCTTCCACCG CCGCGGAGCA CCACACTTGA GCCCGGACCA GCGCGCCCCA
TGGAAACGGC GGGACAAGAA CAACCCGCCC GGCCCCGGAT CTGCTGCCGA CCACGGCGGG
CAGATGACGT TGTTCGATAT GGCGCCCGAT CCCGCCGTCG TCGCAGCGCG TCTCCGCCTC
GAAGAAAGGG ACCTGACCCG CTACTGCGCC GCCATCGTCC GTGAACACGC CGAGAGGGCC
GGCTGGAGCA AGCGGCAACG CAACGACGTA ACCCGCTCGC TGCGGCTGCT GCAGGGTTTC
CGGCTCAGCC CGACCGCGAA GATCCGCGCC ACCGACGTCC TCCAGCTGCG CCAATACTCC
GGCAACGTCC TCTCCACGAT CGACGTGCTG GCCGCGGCCG GCCTGCTCAT CGAGGACCGG
CCGACCCGGC TCGAACGCTA CTTCGCTGCC AAGACCAGCA CCCTGCCACC GGTCATGAAG
GACCAGCTCG AGGTGTGGCT GCAGGTCCTG ACCAACGGCG CCCACCAGGC GCCACGGCAG
ATCCCGCGCG ACCCACAGAC CATCAGGGCG CACATCATGG GCATCGAACC CATCATCCAC
GCCTGGGCCG GAGCAGGCCT CCAATCCTTC GCCGAGGTCA CCCGCACCGA CATCACAGCC
GCGCTGGACG AAACGACGGC CCGCCGCCAC ATCGCCGGGA ACGGACTCAA GTCGCTCTTC
ACGACCCTCA AAGGCCGCCG GCTCATCTTC GCCAATCCGA CCCGCGGGAT GAAGGCGTCC
CCGAAGGCCA GCACTATCCC TCTCGCCCTG GACGCCGCAG CGATCCGCGA GGAGCTGAAC
TCCCCGAAGC CGGTCGTCGC CCTCGCGGTT GCCTTGGTCG CCTTCCACGC ACTGACCAAG
AAACAGCTCA GCGAGCTGCG CCTCACCGAC ATCAGCGACG GACACCTGGT ACTCGGCACC
CGCGACATTC CGCTGGCCGC GCCCGTGCGC ACCCGCCTGG CAGCCTGGCT CGACCAGCGC
AACCGAACCT GGCCAGGCAG TGCCAACCCG CACCTGCTCA TCAACCGGCG TACGGCGCCC
CGGCTCCTGC CCGTCAGCCG GCAGTACCCC TGGACCGGAT TGACGCTGCG GCCCCAGGCG
CTGCGCGAGG ACCGAATCCT CCACGAGATC CACGCCACCG GCGGCGACAT CCGCCGCATC
TGCGACCTGT TCGGGCTCAG CGTCGAAGGC GCCACCCGCT ACCTCAACAC CGTCGAGCAC
CCCGACCTCA CCCTTGAAGG CGAACAGGTT CCCCGAACCT GA
 
Protein sequence
MCPRCQINRV AWTFPRVDYC YVCLPGGPFI APPCSKCGTD MGYFSQGMCA GCHPGSPQYP 
GSCKDCLAWG VYRRYKWTCW PCRWWRSHYP EGVCDFCGRA ARVGERRACR LCLEQARMVQ
EPGHGLDLAG ANRDGHQLFF ANMSFHRRGA PHLSPDQRAP WKRRDKNNPP GPGSAADHGG
QMTLFDMAPD PAVVAARLRL EERDLTRYCA AIVREHAERA GWSKRQRNDV TRSLRLLQGF
RLSPTAKIRA TDVLQLRQYS GNVLSTIDVL AAAGLLIEDR PTRLERYFAA KTSTLPPVMK
DQLEVWLQVL TNGAHQAPRQ IPRDPQTIRA HIMGIEPIIH AWAGAGLQSF AEVTRTDITA
ALDETTARRH IAGNGLKSLF TTLKGRRLIF ANPTRGMKAS PKASTIPLAL DAAAIREELN
SPKPVVALAV ALVAFHALTK KQLSELRLTD ISDGHLVLGT RDIPLAAPVR TRLAAWLDQR
NRTWPGSANP HLLINRRTAP RLLPVSRQYP WTGLTLRPQA LREDRILHEI HATGGDIRRI
CDLFGLSVEG ATRYLNTVEH PDLTLEGEQV PRT