Gene Arth_1138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1138 
Symbol 
ID4446371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1233907 
End bp1235550 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content63% 
IMG OID639688944 
ProductABC transporter related 
Protein accessionYP_830632 
Protein GI116669699 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCAGAG TTTTGAAACT TGAACTCAGA GGGATCACCA AACGCTTCGG CTCCCTTATT 
GCCAACGATC ACATCGATGT GGTTGTTGAA CCCGGGCAAA TTCATTGTTT GCTGGGTGAA
AACGGCGCTG GCAAGTCCAC GCTGATGAAT GTCCTGTACG GCCTGTACGA GCCAAGTGAG
GGCGAAATCC TGGTGGACGG CAAGGCCGTC ACCTTCCGCG GCCCGGGTGA CGCGATGGCT
GCGGGCATCG GCATGGTGCA CCAGCACTTC ATGCTGGTTC CGGTCTTTAC AGTTGCGGAG
AACGTGGCCT TGGGCGCTGA AACCACCAAG GCCGGCGGTT TCCTGAACCT GGATGACACC
CGGCGCAAGA TCAAGGAGAT CTCCGACAAA TACGGTTTCG ACGTCGACCC CGATGCCCTG
ATCGAGGATC TTCCCGTTGG TGTCCAGCAG CGCGTGGAAA TCATCAAGGC CCTGGTCCGC
GACGCCAAGG TGCTCATCCT CGACGAGCCC ACCGCCGTGC TGACACCGCA GGACACCGAC
GAACTCCTGG ACATCATGCG CCAGCTGAAA AGCCACGGCA CCTCGATCGT CTTCATCTCC
CACAAACTCC GCGAAGTGAA GGCCGTTTCC GACACGATCA CGGTCATCCG GCGCGGAAAG
GTGGTGGGCA CGGCGGACCC CGGTGCTTCG ACCACCGAGC TTGCATCCAT GATGGTGGGC
CGCGCAGTCA ACCTGACCCT GGACAAGGCA CCGGCCAAGC CGCAGGAAAC CACGTTCCAG
GTCAAGGACC TCACCGTGAT CGCCCCGACA GGCCAGCATG TGGTGGACGG GATCAGCTTC
GACATTGCCC GCGGCGAAAT CCTGGCCATC GCGGGAGTGC AGGGCAATGG CCAGACCGAA
CTGACGGAAG CCATCCTTGG CCTGCAGGAC CGCGTCCACG GCTCCATCCT GCTTGACGGC
GAAGAACTCC TGGGCCGCAG CGTCAAGGAA GTACTCGGGG CCGGCGTCGG CTTTGTCCCG
GAGGACCGGA CCGTTGACGG GCTGATCGGC ACGTTCTCCA TCGCGGAGAA TCTGGTCCTG
GACCGCTACG ATCAGCCGCC ATTCGCCAAA GGCATCAGCA TGAGCCCTGC CCGGATCCTT
GAGAACGCCA AGTCCCGGAT CGACGAGTTC GACGTCCGGA CCCCGTCCGG GGCGCTGGCC
GCAGGGACCC TCTCCGGCGG AAACCAGCAA AAAGTGGTGA TGGCACGGGA GCTGTCCCGG
CCGCTGCGGC TGTTTATCGC CTCCCAGCCC ACCCGAGGCG TGGATGTAGG TTCCATTGAG
TTCCTGCACA AACGCATTGT TGCTGAACGG GACCAGGGAA CACCGGTGAT GATTGTGTCC
ACGGAACTCG ACGAAGTCAT CGAGCTCGCG GACCGGATCG CAGTGCTCTA CAAGGGCAAG
CTGGTGGGCA TCGTCCCTTC CGGCACCGGA CGCGACGTCC TGGGCCTGAT GATGGCGGGC
CTGTCCCCGG AAAGCGCCCA CGCGGATGCC GCCCAAACGG CCGTAGCCCA CGCGGACGCG
GCCCACGCGG ACGCAGGCGC GACCAGCGGG TACACACCCG TCCAGACGTC CACCTCCGGT
GCCGAAGGAG GCGACCATGA CTGA
 
Protein sequence
MVRVLKLELR GITKRFGSLI ANDHIDVVVE PGQIHCLLGE NGAGKSTLMN VLYGLYEPSE 
GEILVDGKAV TFRGPGDAMA AGIGMVHQHF MLVPVFTVAE NVALGAETTK AGGFLNLDDT
RRKIKEISDK YGFDVDPDAL IEDLPVGVQQ RVEIIKALVR DAKVLILDEP TAVLTPQDTD
ELLDIMRQLK SHGTSIVFIS HKLREVKAVS DTITVIRRGK VVGTADPGAS TTELASMMVG
RAVNLTLDKA PAKPQETTFQ VKDLTVIAPT GQHVVDGISF DIARGEILAI AGVQGNGQTE
LTEAILGLQD RVHGSILLDG EELLGRSVKE VLGAGVGFVP EDRTVDGLIG TFSIAENLVL
DRYDQPPFAK GISMSPARIL ENAKSRIDEF DVRTPSGALA AGTLSGGNQQ KVVMARELSR
PLRLFIASQP TRGVDVGSIE FLHKRIVAER DQGTPVMIVS TELDEVIELA DRIAVLYKGK
LVGIVPSGTG RDVLGLMMAG LSPESAHADA AQTAVAHADA AHADAGATSG YTPVQTSTSG
AEGGDHD