Gene Noca_3111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3111 
Symbol 
ID4597896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3313253 
End bp3314485 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content73% 
IMG OID639777717 
Productarsenite-transporting ATPase 
Protein accessionYP_924300 
Protein GI119717335 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTATCC TCCTGTTCAC CGGAAAGGGC GGCGTCGGCA AGTCCACGGT CGCCGCCGGG 
ACGGCCGCGC TCGCGGCCGC CGACGGGCAC CGCACGCTGG TCCTCTCGAC CGACGCGGCC
CACTCCCTGG CGGACGCCTA CGGCTGCGAG TACGGCGCGA TCGGGCCAGA GGCGACCGAG
GTCGCACCGG GGCTGTTCGT GGTACAGGTC GACGCGCAGC TGCGGTTCGA GCAGTCCTGG
GCCGACATCC AGCGCTACCT GCTCTCGGTG CTCGATGTCG CGGGGGTCGA TCCCGTCGCG
GCGGAGGAGC TGACCGTGAT CCCGGGAGCC GAGGAGGTGC TGGCGCTGCT CGAGCTCCGC
CTGCACGCTC TCTCCGGCGC CTGGGACGTG ATCGTCGTCG ACTGTGCCCC GACGGCCGAG
ACCCTCCGGC TGCTCGCCCT CCCCGAGGCG CTCGGCTGGT ACATGAACCG GGTCTTCCCG
GTCGAGCGGC GCGTCGTCAA GGCGCTGCGG CCGGTGCTGA GCCGGGCCGC CGGGGTCCCG
ATGCCCGGCG ACTCCGTCTT CGACGCGATC GAGCGGCTGC ACGCCGAGCT CGACGAGGTG
CGCACGCTGC TCAGCGGCCC CGACTCGAGC GTGCGGCTGG TGCTGACTCC CGAGAACGTG
GTGCTCGCCG AGGCCCGGCG CTCCTACACG ACGTTGTCGC TGTTCGGCTA CCGCGTCGAC
GGCGTCGTCG CCAACCGGGT CTTCCCCGCC GAGGACGCCG ACGACTGGCG GGCCGGCTGG
GTGCTCGCCC AGGACGAGGT GCTCCGCCGG GTCGAGCAGT CGTTCGCCGG CCTGCCGATC
TGGCGCTCGG AGTACCGCTC CCGTGAGCCG GTGGGCGTCG TGCCCCTGGC CGGCCTCGCC
CGCGATCTGT ACGGCGACGA CGACCCGCTG ACCAGCTCGC CCGGCAAGGC CCCGTTCCGC
ATCCGGCGCA GCGAGGGTGG CGCCGTCGTC CGGCTGGCCC TCCCGTTGGT GTCCCGCACG
GACGTCAACC TGGCCCGCAA TGGCGACGAT CTCGTCGTGA CCGTGGGATC GTATCGACGG
TTGATCACGC TCCCATCGGG CCTGGTGCGG TTCCGTATCG CCGGGGCCCG GGTGGAGCAC
GGGGAGCTGC AGGTGCGGTT CGTCGAGGAT GCCGACACCG CCAGCGCCGC CGGCACGGCG
GTCGGCGCGG CTGCGGACGA AGCTGGGAGA TGA
 
Protein sequence
MRILLFTGKG GVGKSTVAAG TAALAAADGH RTLVLSTDAA HSLADAYGCE YGAIGPEATE 
VAPGLFVVQV DAQLRFEQSW ADIQRYLLSV LDVAGVDPVA AEELTVIPGA EEVLALLELR
LHALSGAWDV IVVDCAPTAE TLRLLALPEA LGWYMNRVFP VERRVVKALR PVLSRAAGVP
MPGDSVFDAI ERLHAELDEV RTLLSGPDSS VRLVLTPENV VLAEARRSYT TLSLFGYRVD
GVVANRVFPA EDADDWRAGW VLAQDEVLRR VEQSFAGLPI WRSEYRSREP VGVVPLAGLA
RDLYGDDDPL TSSPGKAPFR IRRSEGGAVV RLALPLVSRT DVNLARNGDD LVVTVGSYRR
LITLPSGLVR FRIAGARVEH GELQVRFVED ADTASAAGTA VGAAADEAGR