Gene Apar_0831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0831 
Symbol 
ID8413697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp918794 
End bp920035 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content49% 
IMG OID645022414 
ProductNusA antitermination factor 
Protein accessionYP_003179851 
Protein GI257784634 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000351407 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000535814 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCATCTG AAATGATGGA AGCTTTGATG TTGCTTTGCC AGGAAAAGCA CATCGATGAG 
TTGTATTTAC TGGATCGTTT GGAGCAGTCT CTGGCAAAAA GCTATGCTGA TGTCCTCCAC
CTTGATTTTG GTGCTCGCGT CACTATTGAT AGGGCTACAG GCCGTGTATA CGTTTATGAG
CTTGTTCCTA AGGGCGAGCC AGATGAGGAG ACTGGTGAGT ACACCGAGTT TGATGAGGTA
GATGTAACTC CTCCTGATAC GAGCCGTATT GCTGCTCAGC ATGCTAAGGC AGAGATTAAG
ACGCTTGTTC GTAATGCTGC TCGTGCTCAG ATTTATGATG AGTTCCGTGG TCGCGTTGGT
GACATCATTA CTGGTACCGT TCTCCAGTCC ACTCCTGATT TCACCATCAT TAAGATTCGT
GAGGGTGTAG AGGCAGAGCT TCCACACTTT GACCAGCGTC GTTTCCCTGA CGAGCGTGAT
GAGCGTCCAG CAGGAGAGCG TTATCTACAC AATCAGCGCA TCAAGGCAAT TATTGTTGAC
GTTCGTGATC CTAATGCAAC TCAGCCAGCA GTTCGTGGTG AGCGCCAGCG TCCACCAATT
GTTGTCTCTC GTACCCACCC AGATCTTATC CGTCGTCTCT TTGAGCTTGA GGTCCCAGAG
GTTTATGACG GTGTAGTGAG CATTCGTTCT ATTGCTCGTG AGGCTGGCGT TCGTTCTAAG
ATTGCTGTTT CTTCCGTTGA TGAGCGTCTT GATCCTGTTG GTGCTTGTGT TGGTCCTAAG
GGCAGCCGTG TTCGTACCGT AGTTTCTGAG CTCCGCGGGG AGCGCGTTGA CGTTGTACCT
TGGTTTGATG ACGCTGCTCG TTGTGTTGCC TCCGCACTTT CACCTGCACG TGTTTCTCGC
GTTATTGTTG ATGGCGCAAC TGGTCACGCA ACCGTTATTG TTCCTGATGA TCAGCTATCT
TTGGCTATTG GTAAAGAGGG TCAGAATGCT CGTCTTGCTG CTCGTTTGAC TGGTCTTCAC
ATTGACATCA AGAATGAGTC CCTTGCTGCA AACATTTTGA ACAACCTTCC TGAGGTTGTT
GAAGAGGCTG TTGACGAGGA AGAGATTGCT CATCGTTGCA AGTATGTGAG CCCTAGCGGA
GTTCCTTGCC GCAATATGGC AAGACCTGGT TCTGATTTCT GCGGCATTCA TGATGCCATG
GAGAATGCAG AGATTTCTTC TGATTCAGAC TCATTGATTT AG
 
Protein sequence
MASEMMEALM LLCQEKHIDE LYLLDRLEQS LAKSYADVLH LDFGARVTID RATGRVYVYE 
LVPKGEPDEE TGEYTEFDEV DVTPPDTSRI AAQHAKAEIK TLVRNAARAQ IYDEFRGRVG
DIITGTVLQS TPDFTIIKIR EGVEAELPHF DQRRFPDERD ERPAGERYLH NQRIKAIIVD
VRDPNATQPA VRGERQRPPI VVSRTHPDLI RRLFELEVPE VYDGVVSIRS IAREAGVRSK
IAVSSVDERL DPVGACVGPK GSRVRTVVSE LRGERVDVVP WFDDAARCVA SALSPARVSR
VIVDGATGHA TVIVPDDQLS LAIGKEGQNA RLAARLTGLH IDIKNESLAA NILNNLPEVV
EEAVDEEEIA HRCKYVSPSG VPCRNMARPG SDFCGIHDAM ENAEISSDSD SLI