Gene Apar_1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1181 
Symbol 
ID8414059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1321621 
End bp1323102 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content50% 
IMG OID645022775 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_003180200 
Protein GI257784983 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00250228 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000333718 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGTCT ATAACACTCA GACGCATAAA AAAGAAGAGT TTAAGCCAAT CGATGAAGGC 
AAAATCCGCA TGTACGTATG CGGTCCTACG GTCTACGACC AGATTCATAT CGGAAATGCC
CGTACGTTCC TCTCGTTTGA CGTTATTCGA CGTTATCTGA TGTATAAAGG TTTTGAGGTT
ACGTTTGCTC AGAATCTGAC CGATGTTGAT GACAAGATTA TTCAGCGCGC CAACGAGCAG
GGAAGGACTG CTCAAGAGGT CTCTGAAGAG TTTTCTCAGG CGTTTATCAA GCAGATGCAT
CGCTTCAATA TACTGGATCC AGATATTCGT CCTCGCGCAA CGCATGAGAT TGAAGCCATG
CTCCAGATGA TTCAATCTTT GATTGAGCAG GGTTACGCTT ACGCGGTGCC TTCGGGCGAC
GTGTATTTCT CGGTTCGTTC TGACCACGAC TATGGTGTGC TTTCTGGCCG CGATCTTGAT
CAGCTTCGCG CAGGTGAGCG TGTTGAGGTT AATGACGAGA AACGCGATCC ATTTGACTTT
GCACTCTGGA AGGCTGCTAA GCCTGGTGAG CCTAGCTGGT CAAGCCCATG GGGAGAAGGT
CGTCCAGGTT GGCACAGTGA GTGCTGTGCG ATGATTCATC GCTATTTAGG CACTCCAATT
GATATTCATG GTGGTGGCTC TGACCTGGTA TTCCCTCACC ATGAGAACGA GACCGCACAG
GCATCTTGCG CCTGGCATGC TCCTCTCGCT AATTATTGGA TGCACACCGG AATGCTGCGT
GTTGACGGTG AGAAGATGTC TAAGTCTTTG GGTAACTTCT ATACGCTTAA GGAAGTTTTG
GACAAGTATC CTGCAGATGC AGTACGACTT TTGATGCTTC AAACGCACTA TCGCGCGCCA
CTTGATTTCT CGTTTGAACG CCTTGAAGGA ACCGTTGGCA CACTCGAGCG CATGAAGACC
TGCGTTGCAA ATCTTCGTTG GGCCTACAAG CAGTCTTCTG CTCAGCACGA GCTTTCTGAC
GCTGATCGTA CGCTTGCTGA GGCAATTAAC ATAGCCCAGA GCGAGTTTGA CGCTCAGATG
GATGATGATT TCAATACCGC AGGAGCGCTT GCTGCCATCT TTGCGCTGGT AACTGCTGCA
AATACCTATC TTGCAGAGGT TGGGCAGAAC GTTGGCGCAA GCCCTGTTCT CCGTGCATCA
GATATGTTGT GCGAGCTCAC AGGAGCCTTG GGAATTGATT TGACCCAGGC GGCGTCTACT
TCTGATTTAC CAGAAGAGCT TGTAGCACTT GCCGCCCAGG TAGCAGCCTA TGAAGGCTCT
TCAGCTGACG AGGCCGCAGA GGTTTTGCTT GCTGCTCGTC AGGAAGCTCG TTCCCAGAAG
AACTGGGCAG TTGCCGATCA AATTAGAGAC GGCATTGCCG AGCTTGGACT TTTGATTGAA
GATACCGCCG CTGGCGCTCG ACTTAAGCGT AAGGCAGACT AA
 
Protein sequence
MLVYNTQTHK KEEFKPIDEG KIRMYVCGPT VYDQIHIGNA RTFLSFDVIR RYLMYKGFEV 
TFAQNLTDVD DKIIQRANEQ GRTAQEVSEE FSQAFIKQMH RFNILDPDIR PRATHEIEAM
LQMIQSLIEQ GYAYAVPSGD VYFSVRSDHD YGVLSGRDLD QLRAGERVEV NDEKRDPFDF
ALWKAAKPGE PSWSSPWGEG RPGWHSECCA MIHRYLGTPI DIHGGGSDLV FPHHENETAQ
ASCAWHAPLA NYWMHTGMLR VDGEKMSKSL GNFYTLKEVL DKYPADAVRL LMLQTHYRAP
LDFSFERLEG TVGTLERMKT CVANLRWAYK QSSAQHELSD ADRTLAEAIN IAQSEFDAQM
DDDFNTAGAL AAIFALVTAA NTYLAEVGQN VGASPVLRAS DMLCELTGAL GIDLTQAAST
SDLPEELVAL AAQVAAYEGS SADEAAEVLL AARQEARSQK NWAVADQIRD GIAELGLLIE
DTAAGARLKR KAD