Gene Apar_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1101 
Symbol 
ID8413974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1243687 
End bp1245063 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content44% 
IMG OID645022690 
ProductNa+/H+ antiporter NhaA 
Protein accessionYP_003180120 
Protein GI257784903 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3004] Na+/H+ antiporter 
TIGRFAM ID[TIGR00773] Na+/H+ antiporter NhaA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGTA TTTATAGAGA GCCCGCTGTA GTTAGAAGAA TTGGTCAACG CTCTTCTCTA 
AAGAAAATTA CTTCTAACAG CACTATTGCG GCAGCAGTTA TGGTGCTTGC AGCGCTTATT
GCACTTGTAG TTGCCAACTC TCCCGCTCAT GAGGTGGTTC GAGAAATTCT AGAAGTTCAA
ATGAGTTTCT CGCTTGGAAT AATCCATGCT CACATGGCGC TAGAGACTTT TGTGAATGAT
TTTCTTATGG CAATCTTCTT CTTGTTGGTG GGTATTGAGC TCAAGTATGA AGTAACTGTT
GGTCAGCTCA GAAAGCCAAG ACAGGCAATG CTTCCTATGC TTGCTGCTGT TGGTGGCGTT
GCAGTTCCTG CCCTTATTTA TTTGGTGCTT AATGCAAGCA CTGCTCCTCA TGGATGGGCA
ACACCAATTG CAACTGATAT TGCCTTTGCG CTTGGTGTTA TGTCACTTTT AGGAAACAGA
ATCAGTCCCC AGACCAAGGT TTTCTTCCAG ACACTTGCTA TTGCTGACGA TATCCTAGCA
ATTGTGGTTA TCGCGCTCTT TTATGGTCAG ACACCAAATA TTGCCTGGTT AGCGGCTTCT
CTTGGTGTAC TTGTTATTTT GTGGGGAATG AATCGTCTAA AGATTTATTC TTCCGGTCCG
TACCTGCTTG TAGGTTTGGT TCTTTGGGTT TGCATGTATC ACTCGGGAAT TCACGCCACT
CTTGCAGGTG TCCTTCTTGC CTTCTTCTTG CCATCTCACT CTGATGTTCG CTTGAGTAAG
TTAGGAAGCT GGCTTTCAAG GCGCGCTCGT GAGCTGGATG ATCATTATGA TGATGAGCAT
CACGTTTTGG GCCAGCACGA CTTTACACAT GGTGCCAACT CAATTGAGCA TGTCATGCAT
CATGTAACTC CACCGCTTCA GCGCGTTGAG CATTATATTT CTGTCTTTGT TAACTTTATT
GTTCTGCCTA TCTTTGCGTT TGTAAACGCA CAGATTACGC TGGTAGGAGC AGATTTTGGT
GCTCTGTTGA GCAGTACTAT TGCACACGGC GTGTTCTTTG GTGCTGTTCT TGGAAAGCCA
CTAGGAATCA TTTTTGTAAC GTTCCTGCTG GTCAAATGTA AGGTTTGCAA ACTTCCTGCA
AAAGTTGATT GGGTTCAGAT TACAGCCGTT GGCCTTATGG GTGGCCTTGG CTTTACCATG
TCAATTCTTA TCTCTAATCT TGCATTCCCA GATGCAGGGG AGATTCTTGC AGCTAAAGTG
GCTGTTCTCG CCGCATCGTT TGCTTCTGCA ATCTTGGGTC TACTATTTGT GCACATCACA
GAGTTTTATA AACACCAAAA GAAGAACAGC TTGAGGGAAG ATGTCGTACA AGAATAA
 
Protein sequence
MSSIYREPAV VRRIGQRSSL KKITSNSTIA AAVMVLAALI ALVVANSPAH EVVREILEVQ 
MSFSLGIIHA HMALETFVND FLMAIFFLLV GIELKYEVTV GQLRKPRQAM LPMLAAVGGV
AVPALIYLVL NASTAPHGWA TPIATDIAFA LGVMSLLGNR ISPQTKVFFQ TLAIADDILA
IVVIALFYGQ TPNIAWLAAS LGVLVILWGM NRLKIYSSGP YLLVGLVLWV CMYHSGIHAT
LAGVLLAFFL PSHSDVRLSK LGSWLSRRAR ELDDHYDDEH HVLGQHDFTH GANSIEHVMH
HVTPPLQRVE HYISVFVNFI VLPIFAFVNA QITLVGADFG ALLSSTIAHG VFFGAVLGKP
LGIIFVTFLL VKCKVCKLPA KVDWVQITAV GLMGGLGFTM SILISNLAFP DAGEILAAKV
AVLAASFASA ILGLLFVHIT EFYKHQKKNS LREDVVQE