Gene Arth_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0846 
Symbol 
ID4446649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp914474 
End bp915997 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content65% 
IMG OID639688653 
ProductNa+/solute symporter 
Protein accessionYP_830344 
Protein GI116669411 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCAA ACTTCGTCAA CATCGCCATC GTGGTGGTGT ACCTGATCGC CATGCTGGCC 
TTCGGCTGGT GGGGCAAATC CCGCACCAAG AACAACAGCG ACTTCCTGGT GGCCGGCCGC
AGGCTTGGCC CCTTCCTCTA TACCGGGACC ATGGCGGCCG TTGTCCTCGG CGGCGCCTCA
ACGGTGGGCG GTGTGGGCCT CGGCTACAAG TTCGGCATCT CCGGCATGTG GCTCGTGGTG
GCCATCGGGT CCGGAGTGCT CCTGCTGAGC CTGCTCTTCG CCGGCACCAT CCAGAAGCTG
AAGATCTACA CGGTGTCCCA AATGCTGACG CTCCGATACG GCAGCCGCGC CACCCAGACC
TCAGGAATCG TGATGCTGGC CTACACGCTG ATGCTCTGCG CCACCTCCAC CGGGGCCTAC
GCCACCATCT TTGTGGTGCT GTTCGGCTGG GACCGCGCCC TCGCCATCGC CGTTGGCGGG
GCGATCGTCC TGGTGTACTC CACCATTGGC GGCATGTGGT CCATCACCCT TGCGGACCAG
GTCCAGTTCG TCATCAAGAC GGTGGGGATC TTCCTCCTGA TGCTCCCCTT CACGCTTAAT
GCAGCCGGAG GCCTGGACGG CATCCGCAGC CGCGTCGAGG ACAGCTTCTT CCAGATCGAC
GGCATCGGGA TCCAGACCAT CATCACGTAC TTCGTGGTCT ACACCCTCGG CCTCCTGATC
GGCCAGGACA TCTGGCAGCG CGTCTTCACC GCCAAGACGC CCACCGTGGC ACGCTGGGGC
GGCGCAACGG CCGGCATCTA CTGCATCCTT TACGGTGCGG CCGGCGCCCT GATCGGCCTG
GGTGCGCGAG TGGCCCTCCC GGAGATCGAC GTCGCAAACC TCGGCAAGGA CGTTGTCTAT
GCCGAGGTGG CCCAGAACCT GCTGCCCGTC GGCATCGGCG GACTGGTGCT CGCAGCAGCC
GTAGCGGCCA TGATGTCCAC CGCCTCCGGC GCCCTGATCG CGGCGGCAAC CGTGGCCCGT
GCCGACGTCC TTCCGTTCGT TGCCAGCTGG TTCGGCAAGG ACATCAACAC CGATGACACC
GACAACCCCG AGCACGACGT CAAGGCGAAC CGCATGTGGG TCCTTGGCCT TGGCATCGTG
GCCATCCTCA TCGCCATCAT CACCAAGGAC GTCGTGGCAG CCCTGACCAT CGCCTACGAC
ATCCTGGTGG GCGGACTCCT GGTCGCGATC CTTGGCGGAC TCGTCTGGAA ACGGGGCACG
GGCGTGGCCG CGGCGGCATC CATGGCGGTA GGCACGGTGG TAACACTCGG CACCATGATC
TACCTGGAGA TCAATGCCGC GGCGCCGCTG GACGGCATCT ACGCCAATGA GCCGATCTAT
TACGGCCTGC TGGCGTCAGG CATCGTCTAC GTGGTGGTGT CCGTGGCAAC CAAGCCCACC
GACCCCCGGG TCATGCGGAA CTGGCAGGAG CGCGTCGCCG GCAACGTCGA CGAAGAAGAG
CCGGCTCCGG CTCTGGTCAA CTAA
 
Protein sequence
MDANFVNIAI VVVYLIAMLA FGWWGKSRTK NNSDFLVAGR RLGPFLYTGT MAAVVLGGAS 
TVGGVGLGYK FGISGMWLVV AIGSGVLLLS LLFAGTIQKL KIYTVSQMLT LRYGSRATQT
SGIVMLAYTL MLCATSTGAY ATIFVVLFGW DRALAIAVGG AIVLVYSTIG GMWSITLADQ
VQFVIKTVGI FLLMLPFTLN AAGGLDGIRS RVEDSFFQID GIGIQTIITY FVVYTLGLLI
GQDIWQRVFT AKTPTVARWG GATAGIYCIL YGAAGALIGL GARVALPEID VANLGKDVVY
AEVAQNLLPV GIGGLVLAAA VAAMMSTASG ALIAAATVAR ADVLPFVASW FGKDINTDDT
DNPEHDVKAN RMWVLGLGIV AILIAIITKD VVAALTIAYD ILVGGLLVAI LGGLVWKRGT
GVAAAASMAV GTVVTLGTMI YLEINAAAPL DGIYANEPIY YGLLASGIVY VVVSVATKPT
DPRVMRNWQE RVAGNVDEEE PAPALVN