Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3626 |
Symbol | |
ID | 5455837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3881572 |
End bp | 3883182 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640879210 |
Product | NusA antitermination factor |
Protein accession | YP_001414881 |
Protein GI | 154254057 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.475474 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTAA GCGCCAACCG GCTCGAACTG TTGCAGATCG CGGATGCGGT CGCGCGCGAA AAGTCGATCG ACCGCGAAGT CGTCATCGAG GCGATGGCGG AAGCGATCCA GAAGGCCGCG CGTTCCCGCT ACGGCGCGGA AAATGAAATC CGTGCACGGA TCAATCCGCA GACCGGTGAA ATTCGCCTCG TGCGCCTCCT CGAAGTGGTG GACACAGTCG AAAACGACGC CGTCCAGATC GACCTGAAAT CCGCGCAGCG CCGCAATCCG GCCGCGCAGC TCGGCGATCT CATCGAGGAC GAGCTGCCGC CGGTCGATTT CGGCCGCATC GCCGCGCAGA CCGCCAAGCA GGTCATCGTG CAGAAGGTGC GCGACGCCGA GCGCGAGCGC CAGTACGACG AATACAAGGA TCGCATCGGC GAGATCGTCA ACGGCATCGT CAAGCGCGTC GAATACGGCA ACGTCATCGT CGATCTCGGT CGCGGCGAAG CCATTGTCCG CCGCGACGAG CTGCTTCCGC GCGAAACCTT CCGCAATGGC GATCGCGTCC GCGCCTATAT CTATGACGTG CGCCGCGAAC AGCGCGGCCC GCAGATTTTC CTCTCGCGCT CGCATCCCCA GTTCATGGCG AAGCTCTTCG CGCAGGAAGT GCCGGAAATC TATGATGGCA TCATCGAGAT CAAGGCCGTC GCCCGCGATC CGGGCAGCCG TGCGAAGATC GCCGTTCTCT CGAACGACTC GTCGATCGAT CCCGTCGGCG CCTGCGTCGG TATGCGCGGC TCCCGCGTGC AGGCCGTGGT GAACGAATTG CAGGGCGAAA AGATCGACAT CATCCAGTGG TCGCCCGATG CCGCCACCTT CATCGTCAAT GGCCTCGCGC CGGCCGAAGT CGTCAAGGTC GTGCTGGACG AAGATGCACA GCGCATCGAA GTCGTCGTGC CGGACGATCA GCTCTCGCTA GCCATTGGCC GCCGTGGCCA GAACGTGCGC CTCGCCTCGC AGCTCACCGG CTGGGATATC GACATCCTGA CCGAGGCCGA AGAGAGCGAA CGCCGTCAGG CCGAATTCCT CAGCCGCACG GCAACCTTCG CCGAGGCGCT TGATGTCGAC GAGATGATCG CGCAGCTCCT CGCCTCCGAA GGCTTCGCCT CGATCGAGGA AGTGGCTTAT GTCGACCTCG ATGAAATCGC CGAAATCGAA GGCTTCGACG AAGACACCGC TCAGGAAATC CAGAGCCGCG CGCTCGAATA TATCGAACGC CAGAACGCGG AGTTCGATGC GAAGCGCCGT GAGCTCGGCG TCGCCGATGA AGTGGCCGAC GTGCCGGGCG TCACGCCGAA GATGATGGTT GCCTTCGGCG AGAACGACGT GAAGACGGTC GAGGACCTTG CCGGCTGTGC CACCGACGAT CTCATCGGCT GGAACGAGAC GGTGAACGGC GAGCGCAAGC ATCAGCAGGG TATCATCGAG GGCTTCGATC TGACCGCCGA GGAGGCGAAC GACCTCATCA TGCAGGCGCG CCTCAAGGCC GGCTGGATCA CCGAGGCCGA TCTCGCCTCC GACGAAGAGG AAGAGGCCGG TGAAACGGAC GAGGTTGGTG AAGAAGCCTG A
|
Protein sequence | MAVSANRLEL LQIADAVARE KSIDREVVIE AMAEAIQKAA RSRYGAENEI RARINPQTGE IRLVRLLEVV DTVENDAVQI DLKSAQRRNP AAQLGDLIED ELPPVDFGRI AAQTAKQVIV QKVRDAERER QYDEYKDRIG EIVNGIVKRV EYGNVIVDLG RGEAIVRRDE LLPRETFRNG DRVRAYIYDV RREQRGPQIF LSRSHPQFMA KLFAQEVPEI YDGIIEIKAV ARDPGSRAKI AVLSNDSSID PVGACVGMRG SRVQAVVNEL QGEKIDIIQW SPDAATFIVN GLAPAEVVKV VLDEDAQRIE VVVPDDQLSL AIGRRGQNVR LASQLTGWDI DILTEAEESE RRQAEFLSRT ATFAEALDVD EMIAQLLASE GFASIEEVAY VDLDEIAEIE GFDEDTAQEI QSRALEYIER QNAEFDAKRR ELGVADEVAD VPGVTPKMMV AFGENDVKTV EDLAGCATDD LIGWNETVNG ERKHQQGIIE GFDLTAEEAN DLIMQARLKA GWITEADLAS DEEEEAGETD EVGEEA
|
| |