Gene Apar_1106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1106 
Symbol 
ID8413979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1251982 
End bp1253268 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content41% 
IMG OID645022695 
Producthypothetical protein 
Protein accessionYP_003180125 
Protein GI257784908 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTTTA TCCCTGATCG TCTGCGCCAG AAGATACATA AAACTTTTGG ACCAAAAGAC 
CCTGACGAGA AAAGCACTCC ACCAACTGCC ACTTCTGTTA CACGAACCTC ATTTGTTATC
CTGGGCATCT CGGTGGTTGC AGCGCCTGTG ATAGCTACCG TTGCAAATAT TTTGAGTCCA
GGAATTTCTT ATGTTCAGCA GGAATTTTTA CCTCCACAGC TTAATTCAAA GGTTCCAGGC
TATGCCCGTG CGCTGATTCA GCTTGCGCAA CGAGAAGGCT CTCTTAATCT TCTTTGTACT
TCCAAAGAAG CGGTAAAACC CTTGGTAGAA ACATATTCAA GAGCGTTTTC TGTGCCTGTT
ATTGTGGGTG AGGCAACAGA GGAAGAAATT CAAGAGTTTA TTCAGCAAAC ATATGTTGCT
AGTAAGAGTG TGGAGTCTGA AACTTCTGAT ATCAAATTGG ATGTTAATTG GGATTTTGAG
AAACCTGATG TCATTTTCTT AGCAAGCGAA GCTTTAATGC AATGCTTTGA AGCTTCCGCG
TATTGTCAGC CGTATGAAGT TTCTTCAACA GATCAGTACG TACGCTCCGA GTTTTTTGAA
GTCACAGGAC AATGGTATGC CTATGCTGCA GATCCGTTAG TTTTACTTGT GAATCAAGAG
CGTCTTAATG AGAAAAAGAT TCAGCGTCCA CGTGAATGGA CAGATCTTCT TATAGACGGT
CTGCGTGGAA GGTATGTTAT TCCTGATCCT GCACTTACGT ATGTGGGCAA GAAGTTTGAG
TCGCTCTTTT TGGGTCAGTA TGGTCTGGAT AAAGGTACTC AAATTATTCA AAGTTTGTTC
GAAAATGTTG ACGCATTTTC TGAGAGTACA TATCAAGCAA TTAGAGACAC AGGATTTGGT
AAGTACAGAG CCTGTGTTTG TTCACTAAGT GATGCGTATA GGGCAATTAC TAAAGATGAT
TTTGATAATC TCTCAGTTGT TCTGCCGGGT GCTTCTTATT TCTCTACGAT AAGGTCATTT
ATATGTCAGA GGTCAAAACA TACGTATTCA GCATATTTAT GGCAGGAATT TATTACCAAA
CGGGATTCTG CTGAACACAT GGGAGAAATG GACTCGTTTT TCTCGCCAGT TATTGAGGGA
ACGCCAAATC CTTGGTATGC AAGTCGTCTA AATTTGACAG GGGTAAGTGA ATCAATGCAG
ATAGTTCCTG CTCCAACAGA TGCAGAAGGC AATCTGATCA AGCTTGAAGA TGTTCATGCT
GTATACAGTA AGTTGGTTAA AGCGTAG
 
Protein sequence
MKFIPDRLRQ KIHKTFGPKD PDEKSTPPTA TSVTRTSFVI LGISVVAAPV IATVANILSP 
GISYVQQEFL PPQLNSKVPG YARALIQLAQ REGSLNLLCT SKEAVKPLVE TYSRAFSVPV
IVGEATEEEI QEFIQQTYVA SKSVESETSD IKLDVNWDFE KPDVIFLASE ALMQCFEASA
YCQPYEVSST DQYVRSEFFE VTGQWYAYAA DPLVLLVNQE RLNEKKIQRP REWTDLLIDG
LRGRYVIPDP ALTYVGKKFE SLFLGQYGLD KGTQIIQSLF ENVDAFSEST YQAIRDTGFG
KYRACVCSLS DAYRAITKDD FDNLSVVLPG ASYFSTIRSF ICQRSKHTYS AYLWQEFITK
RDSAEHMGEM DSFFSPVIEG TPNPWYASRL NLTGVSESMQ IVPAPTDAEG NLIKLEDVHA
VYSKLVKA