Gene Arth_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0201 
Symbol 
ID4447359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp210469 
End bp211596 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content68% 
IMG OID639687996 
Producthypothetical protein 
Protein accessionYP_829702 
Protein GI116668769 
COG category[S] Function unknown 
COG ID[COG4129] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATCC CAGCAGGACT TTCAGCAAGC GGGCGTTTCC TGCGCGGCCG CATCCGCACC 
GGCCTGGTCC GGAGCCGCAA CTCGCTGATC CCCGCCGTCC AGATGACCGG GTGCGCCGTG
GGGGCTTATG CCTTCGCCGA GTACGTACTG GGGCATTCGG GTCCCCTTTT CGCGGCGACG
TCGTCGCTGA TCGCGCTGGG CTTCTCGCGG GAGCCCCGGC TCCGCCGGGT GGTTGAGGTG
GGCCTGGGCT GCACCATCGG CATTGCCGTG GGCGACCTGC TCCTGCACTG GCTGGGTGGC
GACATCTGGG TGGCCGCCGT CGTGCTCCTC ACCTCCATCC TGCTGGCGCG TTTCCTGGAC
AGCGGCAACA TCTTCACCAC GCAACTGGGG CTTCAGTCGC TCCTGGTGGT GCTGCTGCCG
GCGCCCGCCG GGGGGCCGTT CACCCGCAGC ATCGATGCGA TAGTGGGTGG ACTGTTCGCG
CTGCTGGTCA CCATCCTCAT CCCCAAGGAT CCCCGTCGGG AACCGCGCAA GGACGTCCGG
AAACTGCTGC ACGAGCTCGC CGAGGTGCTG CGCGAATGCG CCCAGGCCCT ACTTGAAAGC
GACTCCACCC AGGCGTGGCA TGCCCTGATC CGCGGCCGAA ACTGCCAGCC CCTGGTGGAC
GCAATGCGCC AGACCCTGCG CGCTTCCGGC GAGGTTGCCA CGCTCGCACC TGCGTACCGC
CGGCACCGGG ATGAACTGGA CCGGCTCGAG CAGTCGCTGG ACTTCATCGA CCTTGCGCTC
CGTAACAGCC GCGTTTTTGC GCGCCGGCTG ACGAGCGCCA TCAACCATGC GGCGTTGTCC
GACGAAGCCA CGGAAAACAT CGCCGAGGTG CTGCAGGAAA CCGCTGCCGC CATCGACGAG
CTCTCCCTGG GCCTTGCCGA AACGCACGAG GGCGCGCGGC GGGCCCATCT CCGGACGGCC
CGGCAGGACC TGAGCGGGAT CGCGCTGCGA CTTCACCCGA AGCTGCTGGA TGTCCAGCGC
CTCGAGGGCG AAACGGTGGT CATGCTGTTT CGTCCGCTGA TGGTGGACCT TCTCGAGGCC
AGCGGAATGG ACGCCCTGGA GGCCCGGGAC ATCCTCCCGC CGCTGTAG
 
Protein sequence
MAIPAGLSAS GRFLRGRIRT GLVRSRNSLI PAVQMTGCAV GAYAFAEYVL GHSGPLFAAT 
SSLIALGFSR EPRLRRVVEV GLGCTIGIAV GDLLLHWLGG DIWVAAVVLL TSILLARFLD
SGNIFTTQLG LQSLLVVLLP APAGGPFTRS IDAIVGGLFA LLVTILIPKD PRREPRKDVR
KLLHELAEVL RECAQALLES DSTQAWHALI RGRNCQPLVD AMRQTLRASG EVATLAPAYR
RHRDELDRLE QSLDFIDLAL RNSRVFARRL TSAINHAALS DEATENIAEV LQETAAAIDE
LSLGLAETHE GARRAHLRTA RQDLSGIALR LHPKLLDVQR LEGETVVMLF RPLMVDLLEA
SGMDALEARD ILPPL