Gene Arth_3016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3016 
Symbol 
ID4444383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3384726 
End bp3386378 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content69% 
IMG OID639690840 
Producthypothetical protein 
Protein accessionYP_832495 
Protein GI116671562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTCC AATTCGAACT GCCTACGACT CTTCTGCTGG GACCGCTGGC CGCCGCCGTC 
GTTATCTACC TCATCCTGAG GTGGGTGGTA TGGGAACCCC GGATGGGCGC CTCGCCCGAA
ACTGTTTCGC AGCACGCGCT CTGGGTGGGC GTCATCGGAT GGATGACTAG CTCCCTCCAG
GGGGCCATGA ATATCGGCAT CATTCCGTCG GGCAGAACCA CCAGCCCCGC GATCGGCCCT
TTCCTGGTCA CCCCGGACAC CATTATCCCG GCCCTGGCAT GGCCCATCCT CGGAACTATC
GGAGTCCACG CACTGGGCCA GCTCAGCTAT CCGCGCCCCC GGGGACCCCG CCGCAAGGCC
AGCCTCCAGG TCCGGAAAAT CCGCGACTTC CTTCCGCGTC CGCTGGCCTG GACAACATTG
GCCATTTTCA CGGGCGCAGC CGGATTCACC GCATGGACCG CTACGCTCCC GGGCTTCGCA
GCAATTGCGT ACGGCTCGGT CCGGGAGGAC CCGCAGGGCT ACCGGACCAT CGGCGGCGAC
GGGCGCATCC CCGGTTCTGA ACTGGCCGGC TGGCTGGGTG CCGCGCTGGT TGCCCTGGCT
GCGTGCACCT GGCTGGTGCT GCTGCTGATC TGCCGCAGAC GGCAGCTCGA GCAGCTGACG
GACCACGACA ATTCGATCCT GCGAACCATC GCCATGAACA GATTGCTCCG CACGGTCTCC
ACGATGGCGT CGGGCCTGGC CGTCATCGCC GGTAACTATG TGGCCCGCCC GGACCCGGCT
GCGGGCAGCA CGTCGTGGAC TAATTTCGCG GCCTTTGCGG GCATGGCGGT CCTGCTTGCC
ATGCTGCTCT GGGCGCCGCC GAAACTAACG GGAACCGCAA CGGCCGCCGG CCTCCGGAAC
GCGAAGCCCG GGCACTTCGG CGCCGGACCG CACCCGGCAT CCAAACTTGT GGTCTCGGTC
GGCGCCGGAC TCGGCGTCGC CGCAGCCCTG CCGGTGCTTG CGGGAGCGTT CCTGGTCCCG
GCCATCATCG CAGCGGGGCC GTCCGGGCCG GCGGTTTTCA TAACACTAGT TGCGGGGCTG
GTCCTTGTGG TCCTCGCCGC CGGCGAGCTG TTGCTGCAGC GCAACTACAC GAGCCCGGAT
GAACCGCGGA CATGGCCCCG GCAGCCGGTG GGCGCGGCCC TTCTCACCAC ACTGATCCTG
GCGCTGCTCG TCCTCGTCAC AACCCTCGTG GTCTCAGCCG CCGGGAACAC CCTGCTTGGC
CGGGACGGAG GGTGGATCCC AGCTGCGCTC CTCAGTACCG CCGGGGTGCT GCTGGCGCTC
CCGGCTTTGC TGGCAACGCG CCTGCGGCGA GGCATCCCCG CTGCTCCGCC CGGGCTCGAC
GCAGCGCTGC GTGCCATCAC CCTCCACCGG ATGGCGCGAA CGCTCTCGGC CATGTTCATG
GCGCAGGCAG CGATGGTCCT GCTGATGAAC AGCCAGGCGT GGGCCGCTGT CTTCGGCGTG
GCATTCCTGC CGTCCATCCC GTGGTGGCCC GCTTCCCTCG CGGGCACGAT CCTGGCCTGC
GCGGCGGTCG CCACAGCCGT GATACCGGTG CGGACCTTCG CAGGCTCCCG TTCCCGGCCC
GCTCCCCTGC CCCGCAGGGA CCACGTCACA TGA
 
Protein sequence
MPVQFELPTT LLLGPLAAAV VIYLILRWVV WEPRMGASPE TVSQHALWVG VIGWMTSSLQ 
GAMNIGIIPS GRTTSPAIGP FLVTPDTIIP ALAWPILGTI GVHALGQLSY PRPRGPRRKA
SLQVRKIRDF LPRPLAWTTL AIFTGAAGFT AWTATLPGFA AIAYGSVRED PQGYRTIGGD
GRIPGSELAG WLGAALVALA ACTWLVLLLI CRRRQLEQLT DHDNSILRTI AMNRLLRTVS
TMASGLAVIA GNYVARPDPA AGSTSWTNFA AFAGMAVLLA MLLWAPPKLT GTATAAGLRN
AKPGHFGAGP HPASKLVVSV GAGLGVAAAL PVLAGAFLVP AIIAAGPSGP AVFITLVAGL
VLVVLAAGEL LLQRNYTSPD EPRTWPRQPV GAALLTTLIL ALLVLVTTLV VSAAGNTLLG
RDGGWIPAAL LSTAGVLLAL PALLATRLRR GIPAAPPGLD AALRAITLHR MARTLSAMFM
AQAAMVLLMN SQAWAAVFGV AFLPSIPWWP ASLAGTILAC AAVATAVIPV RTFAGSRSRP
APLPRRDHVT