Gene Arth_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1733 
Symbol 
ID4445737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1938798 
End bp1940153 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content61% 
IMG OID639689553 
Producthypothetical protein 
Protein accessionYP_831225 
Protein GI116670292 
COG category[S] Function unknown 
COG ID[COG4325] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTGGC GTGGTGCTTA CTGGATGGAC GCTTTACGGA CTCAGCTGTG GCCTGTTCCG 
ACGTTGGCGG TTGCCCTGGC TGTGATCCTG GGGACGGTGT TGCCGGCGCT GGACGCTGCA
GTTGACGGCC AGCTGCCCGA AAGCGTCGCG GTTTTTCTCT TCAGTGGAGG ACCGGAAGCG
GCCAGGTCCG TCCTGCAGGC GATCTCTGGC TCCCTCATCA CTGTGACGTC CCTGACTTTC
TCACTCACCG TTGTCACGCT GCAGCTGGCC AGCAGCCAGT TCTCCCCGCG GCTGCTGCGG
ACCTTTAGCT CCGACCGGTT TGTCCACGGC ACGCTGGCCC TCTTTCTGGC TGCCTTCGCC
TTTGCGTTGA CAGTCCTTCG CAGCGTTCGC GGCGAAAGTG ACGGAACCCG TCTGTTTGTA
CCTGAAATAT CCGTCACCGT CGCATTCGTG CTCGCGATAG CAAGTGTCAT CGGACTGGTG
TTGTTCCTGG CACACCTCAC CCGGGAGATC CGTGTGGAAA CCATGATGCG CAGGGTTAAC
GTGGAAACCC AGGAGACCAT CGACAGGGTA TTTCCTGGGG CACGCCCCGT GCCGGGACCG
GGGCCAGATC CTGTGTCCGA AACATTCCTC ATCAACTCCA CCAGCTCGGG GTTCCTGAAC
ACCTTTGACA AGGACGGCCT GATGCAGGCC GCGGAAAATT CCGGCGCCCT GATTCGGATA
GACCGGGAAC CTGGCAGCTC CCTCGTGGAG GGTGTTCCGT TCGCAACCGC CTGGCCCGCC
GAACCGGGAA CAGCACTCAC TCCGGCAGTC ATCGAAAAGC TGACTGACGA CGTCAACGCT
GCTGTGTCCA CCGGGTTTGA ACGCACGAAT GTCCAGGACG TAGGCTTCGG ATTCCGCCAG
CTGGTCGATG TGGCCGTGCG TGCGCTGTCC CCGGGCATCA ACGACCCGAC CACTGCGGTC
CACGTCATTG GGCACCTTTC CGTCCTCCTG TGCCGGCTGG CGGAAAGAAA CCCCGGGCCC
GACCACTTCA CCGACAAAGA CGGGCGGGTG AGAGTGGTGG TTTCGCTTCC AAAGCTCAAG
GACTTACTCG ATATGGCGAT GAACCAGCCC AGACAATACG GAGCGTCAGA CCCTGTCGTC
GCAGGACGCC TCCTGGCGCT GCTCCAAGAG CTCACCTGGT GCGACCGTAA GAACCAGTAC
CGGTCGGAGA TACTGGACCA CCTGGACCGC ATGCGTAGCG CCATCGTCGC CGCCGATTAT
TCGCCGGCAG AACGACGAAG CCTGCTGGAA CAGGCAGACT CCATGGACCC TTTTCCGGGA
AAGGATCTGC ACCCAGGAAC CAAGGAGAAC CAATGA
 
Protein sequence
MTWRGAYWMD ALRTQLWPVP TLAVALAVIL GTVLPALDAA VDGQLPESVA VFLFSGGPEA 
ARSVLQAISG SLITVTSLTF SLTVVTLQLA SSQFSPRLLR TFSSDRFVHG TLALFLAAFA
FALTVLRSVR GESDGTRLFV PEISVTVAFV LAIASVIGLV LFLAHLTREI RVETMMRRVN
VETQETIDRV FPGARPVPGP GPDPVSETFL INSTSSGFLN TFDKDGLMQA AENSGALIRI
DREPGSSLVE GVPFATAWPA EPGTALTPAV IEKLTDDVNA AVSTGFERTN VQDVGFGFRQ
LVDVAVRALS PGINDPTTAV HVIGHLSVLL CRLAERNPGP DHFTDKDGRV RVVVSLPKLK
DLLDMAMNQP RQYGASDPVV AGRLLALLQE LTWCDRKNQY RSEILDHLDR MRSAIVAADY
SPAERRSLLE QADSMDPFPG KDLHPGTKEN Q