Gene Arth_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2749 
Symbol 
ID4444598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3091817 
End bp3094837 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content64% 
IMG OID639690571 
Producthypothetical protein 
Protein accessionYP_832228 
Protein GI116671295 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.179531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTCCCGTC CCGCCAGCTC CACTCCGCCC GGAAGACCCC AGCCAAGGCG AGGTGCCTTG 
ACGCCGACGT TGATCGTCGT AGCACTGGTT GTGGTCGGAT TCATCTTCTT CGCCAATGTC
TGGACCGATG TCCTCTGGTA CCAGCAGCTC GGGTTCTTTG AAGTATTCCT CACGGAGAAC
CTGGCCCGGA TCATCATCTT CCTTGCCGGC TTCGCGCTGA TGTTCGTGGC CATGTTCTAT
GCCATTCGGA TCGCGTACCA CGCCCGTCCC GTCTACGCGC CGGACTCGGA GATCAGGGAC
AACCTGAACC GCTACCAGGC TCAACTGGAA CCCGTCCGCC GGGTGGTCAT GATCGGTCTG
CCGGTGCTGT TCGGCCTCTT TGCCGGAAGC GCGGCCGCCA GCCAGTGGCA GAAGGTGCTG
CTGTTCCTGA ACCAGGAGCC GTTCGGCCAG AACGATCCGC AGTTCAACCT GGACATCAGC
TTCTACCTGA TGACCCTGCC GTTCCTCGGC TTCGTGACCG GCTTCCTCAT CAGCGTCGTT
GTGGTCGCTG GTATCGCGGG AATCCTGACG CACTATCTCT ACGGCAGCAT CCGGATCATG
GAACGCGGCA TCTTCACCAG CCGTGCCGCG CAAATCCACC TCGCCGTCAC CGGTGCGGTC
TTCCTGCTTC TGCTTGGCGT GAACTTCTGG CTGGACCGCT ATTCCTCAGT TCAGAACAGC
AACGGACGCT GGGCCGGCGC CCTTTACACG GACGTCAACG CCGTCATCCC CACCAAATCG
ATCCTGGCTG TAGCCGCCGC GCTGGTGGCA ATCCTGTTCA TCGTCGCCGC AGTGATCGGC
AAATGGCGAC TGCCCGTCAT CGGCACGGCA ATGCTGGTCA TCACCTCCAT CCTCGCCGGC
GGTGTCTACC CGTGGGTCAT CCAGCAGTTC CAGGTGCGCC CGTCGGAACA GACCCTCGAG
AGGCAGTTCA TCGAGCGGAA CATCAGCATG ACCCGCGCCG CCTACGGCCT GGATAAGATC
CAGGAGAAGC GGTACAACGC CACCACTAAC GCCACCACAG GAGCACTGGC ACCGGACGCG
CAGACCACTG CCAATATCCG CCTCCTGGAC CCGAACCTGA TTTCGGACGC CTTCTCCCAG
CTTGAGCAGT ACCGTCCCTA CTACCAGTTC CCGAGCGCGC TCAATGTGGA CCGGTATGAA
GTTGACGGCA AGGTGCAGGA CACTGTGATT GCTGTCCGCG AGCTGAACCC GGACGGCCTC
AGCGCCAACC AGCAGTCCTG GCTGAACCGG CACGTGGTCT ACACCCACGG TTACGGCGTA
GTGGCCGCTA AGGGCAACAA GTTCACCGCC GACGGCAAGC CTGAGTTCCT GCAGGCCGGC
ATTCCATCCA CCGGCGTGCT CGGCAACGAT TCGACGTACC AGCCCCGGAT CTACTTCGGC
GAAAACTCGC CCGAGTACTC GATCGTAGGG GCACCCGAGG GTTCGCCGCA CCGTGAGCAG
GACCGTCCCG CCGGCAAGGA AGGCGATGGC GAAACCCAGT ACACCTTCAC CGGCAACGGC
GGCCCGAACG TAGGCAGCTT CTTCAACAAG GTCCTCTACG CGATCAAGTT CCAATCGTCC
GACCTGCTGC TGTCCGACGG CGTCAACGCC GAGTCGCAGA TCCTCTACGA CCGCAACCCG
CGGGACCGCG TCGAAAAGGT GGCCCCCTAC CTCACGGTCG ACGGCAACGC CTACCCGGCG
GTGGTGGACG GCCGCGTGAA GTGGATCGTG GACGGCTACA CCACCAGCCA GTACTACCCG
TACTCGCAGC AGGAGCAACT GTCCGCAGCC ACCGCTGATT CGCAGACCAC GGCCGGGCGC
ACGGTCGCGT TGCCGAATAG CTCGGTGAAC TACATCCGCA ACTCCGTGAA GGCAACGGTT
GACGCCTACG ACGGCTCGGT GACGCTTTAC GCCTGGGACG ATCAGGACCC GGTGCTGAAG
GCATGGCAGA ACGTCTTCCC GACATCCCTG AAGCCCTATT CGGAGATGTC CGGCGCGCTC
ATGAGTCACG TCCGCTACCC CGAGGACCTG TTCAAGGTCC AGCGCGAACT GCTGGGCCGC
TACCACGTCA CGCAGCCGGA CAACTTCTAC ACGAACAACG ATGCCTGGTC CGTGCCGAAC
GATCCCACGG TCAAGGAAGA GGTCAAGCAG CCGCCGTTCT ACATGTCACT GCAGATGCCG
GACCAGGACA AGCCCGCCTT CCAGCTCACG TCGTCGTTCA TTCCGCAGGT GGTCAACGGC
ACCGCTCGCA ACGTGCTCTA CGGCTTCCTG GCCGCGGACT CCGATGCCGG CAACCAGAAG
GGCGTGAAGG CGGAAAGCTA CGGCCAGCTA CGGCTGCTGC AGATTCCTCC GGAAGCTCAG
GTCCCGGGCC CGGGCCAGGC CCAGAACAAG TTCAACTCCG ATCCCACAGT GTCCCAGGCG
TTGAACCTGC TCCGGCAAGG CGCGTCGGCC GTCCTCAACG GCAACCTGCT GACCCTCCCG
GTGGGCGGCG GTTTGCTGTA CGTGCAGCCT GTCTACCTCC GCTCCACGGG CGAAACGTCC
TACCCCACAC TGCAGCGCGT GCTGGTTGCC TTCGGTGACA AGATCGGGTT CGCGCCGACA
CTGGATGAAG CGCTGAACCA ACTCTTCGGC GGCCAGTCGG GCGCCAAGGC CGGTGACTTT
GCCAATAACG GCCAGACACC GCCGCCCGCA GCCGGAGGAA GCACTCCGCC GGCCACCGGT
GGTACGGACG CCAAGGCGGA ACTGAAAGCC GCACTGGATG AGGCGAACGC AGCCATCCGT
GCGGGCCAGG AGGCTCTGGC CAAGGGGGAC TTCGCCGCCT ACGGCGAGCA GCAGAAGAAG
CTGTCCGCCG CCCTCCAGAA GGCGATCGAT GCCGAAGCGA AGCTCGGTTC GGAAGGTGCC
TCGCCGACGC CGGGAGCCAC CACGGCTCCC ACAGCGACCC CGTCGGCCGC CGCGACGCCG
TCGCCCTCTC CGAGTAACTG A
 
Protein sequence
MSRPASSTPP GRPQPRRGAL TPTLIVVALV VVGFIFFANV WTDVLWYQQL GFFEVFLTEN 
LARIIIFLAG FALMFVAMFY AIRIAYHARP VYAPDSEIRD NLNRYQAQLE PVRRVVMIGL
PVLFGLFAGS AAASQWQKVL LFLNQEPFGQ NDPQFNLDIS FYLMTLPFLG FVTGFLISVV
VVAGIAGILT HYLYGSIRIM ERGIFTSRAA QIHLAVTGAV FLLLLGVNFW LDRYSSVQNS
NGRWAGALYT DVNAVIPTKS ILAVAAALVA ILFIVAAVIG KWRLPVIGTA MLVITSILAG
GVYPWVIQQF QVRPSEQTLE RQFIERNISM TRAAYGLDKI QEKRYNATTN ATTGALAPDA
QTTANIRLLD PNLISDAFSQ LEQYRPYYQF PSALNVDRYE VDGKVQDTVI AVRELNPDGL
SANQQSWLNR HVVYTHGYGV VAAKGNKFTA DGKPEFLQAG IPSTGVLGND STYQPRIYFG
ENSPEYSIVG APEGSPHREQ DRPAGKEGDG ETQYTFTGNG GPNVGSFFNK VLYAIKFQSS
DLLLSDGVNA ESQILYDRNP RDRVEKVAPY LTVDGNAYPA VVDGRVKWIV DGYTTSQYYP
YSQQEQLSAA TADSQTTAGR TVALPNSSVN YIRNSVKATV DAYDGSVTLY AWDDQDPVLK
AWQNVFPTSL KPYSEMSGAL MSHVRYPEDL FKVQRELLGR YHVTQPDNFY TNNDAWSVPN
DPTVKEEVKQ PPFYMSLQMP DQDKPAFQLT SSFIPQVVNG TARNVLYGFL AADSDAGNQK
GVKAESYGQL RLLQIPPEAQ VPGPGQAQNK FNSDPTVSQA LNLLRQGASA VLNGNLLTLP
VGGGLLYVQP VYLRSTGETS YPTLQRVLVA FGDKIGFAPT LDEALNQLFG GQSGAKAGDF
ANNGQTPPPA AGGSTPPATG GTDAKAELKA ALDEANAAIR AGQEALAKGD FAAYGEQQKK
LSAALQKAID AEAKLGSEGA SPTPGATTAP TATPSAAATP SPSPSN