Gene Arth_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1393 
Symbol 
ID4446099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1552037 
End bp1553656 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID639689204 
Producthypothetical protein 
Protein accessionYP_830887 
Protein GI116669954 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0764737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGTCT TCTCCTCCCT CATGGACGGG TTCGCCACGG CGCTAACCCC CATGAATTTC 
CTCTATGCAG TCATCGGTGT GGTCCTCGGC ACGGCCGTGG GAGTCCTCCC GGGCCTCGGC
CCGGCAATGA CCGTGGCCCT GCTGCTTCCG GTCACCTACG CCCTGGAGCC GACCAGCGCC
TTCATCATGT TCGCCGGCAT CTACTACGGC GGCATGTATG GCGGCTCCAC CACCTCCATC
CTGCTCAACA CACCCGGTGA GTCGTCATCG GTGGTCACCG CCATCGAAGG CAACAAAATG
GCGAAAGCCG GGCGGGCGGC ACAGGCTTTG GCCACCGCCG CCATCGGTTC CTTTGTTGCC
GGCACCATCG GTACCACGCT GCTCGCCGTC TGCGCGCCGA TCGTGGTCCA GTTCGCCGTC
AGCCTGGGCT CCCCCAGCTA CTTCGCGATC ATGGTGCTCG CCCTGCTGGC CGTCACCGCC
GTGCTGGGTT CCTCCCGGCT GCGCGGGTTC GCGTCGCTGG GCCTGGGCCT GGCTATTGGC
CTCGTGGGCA TGGATTCAGT CACTGGCCAG CAGCGCCTCA CGTTCGGGAT GCCGCTCCTG
GCCGACGGCC TGGACATCGT GGTGGTGGCT GTGGCCATCT TCGCCGTCGG CGAGGCACTG
TGGGTGGCTG CGCACCTGCG ACGCACTCCG ATGAACATCA TCCCTGTAGG ACAGCCCTGG
ATGGGCAAAC AGGACTGGAA GCGGTCCTGG AAGCCCTGGC TCCGCGGTAC GGCTTTTGGG
TTCCCCTTCG GAGCACTTCC CGCGGGCGGC GCCGAGATCC CCACTTTCCT GTCCTACGTG
ACGGAGAAGC GGCTCTCCAA GCATCCCGAG GAATTCGGCC ACGGCGCCAT CGAGGGTGTT
GCCGGGCCGG AAGCCGCCAA CAACGCCGCG GCGGCAGGCA CGCTGACCCC CATGCTTGCC
CTTGGCCTGC CCACCAACGC CACGGCCGCC GTCATGCTGG CAGCTTTCAC GTCCTACGGC
ATCCAGCCCG GGCCGCAGCT GTTCGCCAGC GAGGGGCCGC TGGTCTGGGC GCTGATTGCC
AGCCTCTTCA TCGGCAACTT CCTGCTCCTG ATCATCAACC TTCCGCTGGC ACCGGTCTGG
GCAAAGCTCC TGCAGCTCCC TAGGCCGTAC CTCTACGCCG GGATCCTGTT CTTCGCTACG
CTGGGCGCCT ATTCGGTGAA CCTGCAGGCA TTCGACCTGG TCCTGTTGCT GGTGCTTGGT
GCGTTGGGCT TCATGATGAG GCGCTTCGGG CTCCCCGTCC TGCCGCTGGT CCTGGGTGTG
ATCCTGGGGC CGCGCCTGGA AGGCCAGCTG CGCAAGACCC TCCAGCTCAG CGCCGGCAAT
CCGGCCGGGC TGTGGAGCGA ACCGATCGCC GTCGGGATCT GGGTCATCGT GGCGATCATC
CTTCTCTGGC CGCTGCTGTT CATGCTGATC CGCCGCAACC GCCCGATGCG CAGCCCACTG
CTTCCCGCCA CTTCCGGCGC AGCCGAACCG CGGGAAGTTA GCGGCAGCGT CAGCCACGCC
AGCCGCCGTG CGGACAGTTC GTCCGGCGAC GGCCACAGCG ACGGCGACGG CGACGGTTAG
 
Protein sequence
MDVFSSLMDG FATALTPMNF LYAVIGVVLG TAVGVLPGLG PAMTVALLLP VTYALEPTSA 
FIMFAGIYYG GMYGGSTTSI LLNTPGESSS VVTAIEGNKM AKAGRAAQAL ATAAIGSFVA
GTIGTTLLAV CAPIVVQFAV SLGSPSYFAI MVLALLAVTA VLGSSRLRGF ASLGLGLAIG
LVGMDSVTGQ QRLTFGMPLL ADGLDIVVVA VAIFAVGEAL WVAAHLRRTP MNIIPVGQPW
MGKQDWKRSW KPWLRGTAFG FPFGALPAGG AEIPTFLSYV TEKRLSKHPE EFGHGAIEGV
AGPEAANNAA AAGTLTPMLA LGLPTNATAA VMLAAFTSYG IQPGPQLFAS EGPLVWALIA
SLFIGNFLLL IINLPLAPVW AKLLQLPRPY LYAGILFFAT LGAYSVNLQA FDLVLLLVLG
ALGFMMRRFG LPVLPLVLGV ILGPRLEGQL RKTLQLSAGN PAGLWSEPIA VGIWVIVAII
LLWPLLFMLI RRNRPMRSPL LPATSGAAEP REVSGSVSHA SRRADSSSGD GHSDGDGDG