Gene Apar_0947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0947 
Symbol 
ID8413818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1066327 
End bp1068411 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content44% 
IMG OID645022535 
ProductTetratricopeptide domain protein 
Protein accessionYP_003179967 
Protein GI257784750 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGCGTCT CAGATCTTTT CTCGCTAAAT AAAGCTTCAA AGCAATCTCA GGGATCTACC 
CTAGAGAAGA TTTTGCTGCG CTCAAATAAT GTCATTGCCT CTCTTAAAGA CCTTGTTGCA
AACAATCAGA TTACAGAGAC GGGTCTTCAA GAGATGTTGA TGCGCTCTAA AAATCTCGAG
AATACCAATC TGCCTAAGGG CGATGTTCAT AAACTTGATA GAACGGGTCG CTGGTGGTTT
AATGCCAACA CACTGGAATG TGCGGCTGAA GAGTACGATG CTTTTATTGC AACTGAAGCA
ATCCTCAATA TTTCACAAGA TGTTGAGGCT CTAAAAAATA CTCACGCTTC TGAAACGGAT
CTCCAGCTTG TTCGTACTAC ACTCTCGCAG GTAGCAAGTA TCATGCCCAA GTCATCCTCC
CTTACTGAGC AAGTTGCTCC TTTTAGTGGT GGTGCAGACG GCAGCTCTGA CTGGATGAAA
CGTCTCTGGT TTGCCAACTA TGTTGAGAAC ACACCATTTC CCTTTAGGAT GGTGTATAAC
TTCTCTTACA ACCCCCAGCT TGATATTTTG GTATTTGAAT TCTTTGTAGC TCGCCCTCGC
TGCTTTAGTT TCTTATCTGC AGAAAAGTCA GAGCAAATTG CAGCAGCACG TGCTTATGCG
CTGCGAACTT CTCTTTGCGT TGCTCGTATG GCGCTTCAAA GCTGTAAAAT CTCTCGTGTC
TGCATCAATG GCAGCCTACG CGGAGAAGAC CGTATTGTCA TATCGATGGA CTTAAACGAG
GCTGCACTTG CTCGCCTCTT ACCTACTGCA GCAAATACTC AAATTGACAG TAACAGCTTT
CCACAAGATC CTGCGCTTAG AGTTTCTTTT GACACTGAAG GTTGGTTCAA CGAGGTTGAA
CCCTTCATGA AGCCAACTGA TGAGTGGGTT TCTCCTCGTA GCTTCTTTGA GGTACCAGAC
TTGTCTGATC GTCCTTGTTC GGCGGCGGTT ACCGCTATTT GTGGAGCTCA AAAGGTAAAT
GATCTTGGGT ACTCTGAAGC AGCTCATCGC ATTAAGCTTT GGAATACCAC CCTCAACAAC
ATCCCAAAAG ATGCTTCAAC TGCAGATGTA GTTAGCCAGC TTGAGGAAGC AAAAGCTTCA
ACTTCTGATA TTTATGCCAT TGAAGGTCTT GATAGAGTCA TTCACGGCCT GGTTGAAGGG
ACTATTGATT TTTCAGATAG AAGAACCATG GCCGAGAAAT TCCTCTTTGG CTCGCCTCTC
AACAAAACAC TTGAGACCAT AAAGAACATC ATGGATGGAG AGCCAGATCC AGATGCTCTA
GAAAAAACAC TCACAGAGCT TGAATCGCAG GTTTCACCAA CCCTTGATAT GGGCTTATAC
CTGGACGATT CGGATTCTAT CTACAGATAT TTTGATTCCA TCTCAGAGCG CATAGCCTAT
AACCTTGCGT TCCCCAATGA GCCAAGAAAA CTTGTACTTA TTCCAGATAC GTACTTCATG
TCTTTGGCAA GAATGGCACG CGCCTATAAC CTGCTTGAGC AGTCAGAAAA AGCAGAACGC
TACGCGCAAG AAGCCCACAG GATTTCTCCT TTGGGAATTG ATGCAACATT GTTGCTAGTA
AGAACACTGG AAGATCAGTC AAAAATTTTT GAGGCTGCAA AGCTTCTCAA AAATCTTATC
CAACATCTTT TCTCCAGTTC AGATGTTGCA CTGGTGTACT ATCGCCTGGC ATATATGGAG
TGGAAACTGG GACGCTCTGA TCTTTGCGCA GCTTGCTATC AAATGGCCAT AATTATCGGT
GGTAATGTTG CTCAGCCTGC AAAAGAAGAG CTTAAAGACC TTCTAAAGAC AGATTCTTCT
GTCAAAACGC TAGACACCCC TCAAGAGGTC TTCTCGTTCC TTGAGCAAAA TAATATTCCT
GTCTTTGACC AGAAAGCTGT ATTTAACAAG GCAGTTACTA TTGCCAGTGC CTGTGTTAAC
GATGGCGTTT ATTGTGTTGG TCAGAATATG CTCAAGAACT GCCTTGAGAT TACGTTTGAT
GATGCAGCTG CAAAAGTTGA GAGTTCACTG AGATCACCCT ACTAA
 
Protein sequence
MGVSDLFSLN KASKQSQGST LEKILLRSNN VIASLKDLVA NNQITETGLQ EMLMRSKNLE 
NTNLPKGDVH KLDRTGRWWF NANTLECAAE EYDAFIATEA ILNISQDVEA LKNTHASETD
LQLVRTTLSQ VASIMPKSSS LTEQVAPFSG GADGSSDWMK RLWFANYVEN TPFPFRMVYN
FSYNPQLDIL VFEFFVARPR CFSFLSAEKS EQIAAARAYA LRTSLCVARM ALQSCKISRV
CINGSLRGED RIVISMDLNE AALARLLPTA ANTQIDSNSF PQDPALRVSF DTEGWFNEVE
PFMKPTDEWV SPRSFFEVPD LSDRPCSAAV TAICGAQKVN DLGYSEAAHR IKLWNTTLNN
IPKDASTADV VSQLEEAKAS TSDIYAIEGL DRVIHGLVEG TIDFSDRRTM AEKFLFGSPL
NKTLETIKNI MDGEPDPDAL EKTLTELESQ VSPTLDMGLY LDDSDSIYRY FDSISERIAY
NLAFPNEPRK LVLIPDTYFM SLARMARAYN LLEQSEKAER YAQEAHRISP LGIDATLLLV
RTLEDQSKIF EAAKLLKNLI QHLFSSSDVA LVYYRLAYME WKLGRSDLCA ACYQMAIIIG
GNVAQPAKEE LKDLLKTDSS VKTLDTPQEV FSFLEQNNIP VFDQKAVFNK AVTIASACVN
DGVYCVGQNM LKNCLEITFD DAAAKVESSL RSPY