Gene Arth_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3043 
Symbol 
ID4444307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3410743 
End bp3412314 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content68% 
IMG OID639690867 
ProductNa+/H+ antiporter 
Protein accessionYP_832522 
Protein GI116671589 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0025] NhaP-type Na+/H+ and K+/H+ antiporters 
TIGRFAM ID[TIGR00831] Na+/H+ antiporter, bacterial form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00823148 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCAGC TGGCGCTGAT TATCGGACTC CTGCTTGCCA CAGTCGTGGC GGTGGGTCTG 
GGGGATAGGC TCCGGCTGCC CTATCCCGTG CTCATGCTGA TCCTGGCGGC ATCCCTGACG
TTCATCCCCG GCTTTCCGGC GATGGATATC TCACCGGAGC TGATCTTGCC CATCTTCCTG
CCGCCGCTGC TGTTCGCGAC GGCCCAGCGC AGTTCCTGGG CCGTGTTCCG GGTCCGCTGG
CGCACGCTGG TCATGCTCGC CGTGGCCCTG GTGGTGGTAT CCACGGCAGC AGTCGCCGGG
GCGGCCTGGC TCATGATCCC GGGAATCGGG ATCCCCGCCG CCATCGCCCT GGGCGCCATG
GTGGCTCCGC CGGACCCCGT GGCCGTGGAG TCCGTGGCCG GCCGGGTGCA TATGCCGCGG
CGGCTCATTA CGGTCCTGCA GAGCGAGGGC CTCTTCAACG ATGCCGCCGC TATCGTCATC
TTCCAGGCTG CCGTGGCGGC GGCCGTCTCC GGCAGCAGGA TAGGGCCCGA CGTCGTCCTC
AAGTTCCTCG TGGGGGCTGC GGTTGCGGTG CTGGTGGGCG TCGCCATGGG CTGGGTCACG
GCGCTGATCA CCCGGTTGGT GACCTCCATG GTGGCCCGCA GCGCGGTCAC CCTGGTTGTA
CCGTTTGCCG CCTACATCCT CGCCGAGGAA GTGCACGCCT CAGGCGTGAT CGCCGTCGTC
GTTACCGCCC TGGAGATGCA GCGCCACTCG CGTCCGCAGG ACGCCGCCGA ACGGGTCACC
CGGACGGCCT TCTGGGACGT GGTGGAACTT CTGGTGACCG GCCTGGCCTT CGGGCTGGTG
GGGCTGGAAA TCCGCCAGGT CATCCACGAT GAAGGCACTG AGATCTACGG CATGATCGGC
ACCGCCGTGG TGGTGTGCGT CATCGTGTTC GCGGTCCGGT TCCTGTGGCT CGGCCTCCTG
GCGGCCTCGG CACGCAAACG GGAGAACCTG CTGCAGCCCA CCTCGGCCAA GGAAGTCCTG
ATCCTGACCT GGTGCGGGAT GCGCGGCCTC GCCACCCTGG CGCTTGCGCT GGCACTGCCC
CTCACGCTGG ATGACGGAAC GCCGTTCCCG GCGCGCGACC ACCTGCTGGT CATCGCCTGC
GCGGTGCTGC TGGCCACGCT GGTGCTGCCG GGCCTGACAC TGCCCTGGCT GATGAAGGTC
CTTAAAGCCA CCGGAGACGG TTCGGAGGAA CGCGACGCCG CCCGGGTGCT CGCCAAGCGG
GCGCAGCAGG CTGCCGTCGC CGCGCTGAAG GACAACGACC TCATGAAAGA GCTCCCGCCG
GAGAAGGTGG CGCTGGTCAA AGAGAAGATG ACGCGCCTGC ATGCGGAGCT CCTAGACGGA
AGCCTGAAGA ATGAATCGGT GGGGGAGAAA CGCAAGCGCG GCCGCGAACT CGCCATTGCC
GTGCAGACCA TCGCGCTCGA CGCCGCCCGC CAGGAAGTCG TGGCGGCCCG GAACGAACCG
GACATGGACC CGGAAGTCGC GGACCGGGTG CTCCGCCAGC TGGACCTCCG CACCATGATC
ATGCCGGAGT AG
 
Protein sequence
MDQLALIIGL LLATVVAVGL GDRLRLPYPV LMLILAASLT FIPGFPAMDI SPELILPIFL 
PPLLFATAQR SSWAVFRVRW RTLVMLAVAL VVVSTAAVAG AAWLMIPGIG IPAAIALGAM
VAPPDPVAVE SVAGRVHMPR RLITVLQSEG LFNDAAAIVI FQAAVAAAVS GSRIGPDVVL
KFLVGAAVAV LVGVAMGWVT ALITRLVTSM VARSAVTLVV PFAAYILAEE VHASGVIAVV
VTALEMQRHS RPQDAAERVT RTAFWDVVEL LVTGLAFGLV GLEIRQVIHD EGTEIYGMIG
TAVVVCVIVF AVRFLWLGLL AASARKRENL LQPTSAKEVL ILTWCGMRGL ATLALALALP
LTLDDGTPFP ARDHLLVIAC AVLLATLVLP GLTLPWLMKV LKATGDGSEE RDAARVLAKR
AQQAAVAALK DNDLMKELPP EKVALVKEKM TRLHAELLDG SLKNESVGEK RKRGRELAIA
VQTIALDAAR QEVVAARNEP DMDPEVADRV LRQLDLRTMI MPE