Gene Plav_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3626 
Symbol 
ID5455837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3881572 
End bp3883182 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content63% 
IMG OID640879210 
ProductNusA antitermination factor 
Protein accessionYP_001414881 
Protein GI154254057 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.475474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTAA GCGCCAACCG GCTCGAACTG TTGCAGATCG CGGATGCGGT CGCGCGCGAA 
AAGTCGATCG ACCGCGAAGT CGTCATCGAG GCGATGGCGG AAGCGATCCA GAAGGCCGCG
CGTTCCCGCT ACGGCGCGGA AAATGAAATC CGTGCACGGA TCAATCCGCA GACCGGTGAA
ATTCGCCTCG TGCGCCTCCT CGAAGTGGTG GACACAGTCG AAAACGACGC CGTCCAGATC
GACCTGAAAT CCGCGCAGCG CCGCAATCCG GCCGCGCAGC TCGGCGATCT CATCGAGGAC
GAGCTGCCGC CGGTCGATTT CGGCCGCATC GCCGCGCAGA CCGCCAAGCA GGTCATCGTG
CAGAAGGTGC GCGACGCCGA GCGCGAGCGC CAGTACGACG AATACAAGGA TCGCATCGGC
GAGATCGTCA ACGGCATCGT CAAGCGCGTC GAATACGGCA ACGTCATCGT CGATCTCGGT
CGCGGCGAAG CCATTGTCCG CCGCGACGAG CTGCTTCCGC GCGAAACCTT CCGCAATGGC
GATCGCGTCC GCGCCTATAT CTATGACGTG CGCCGCGAAC AGCGCGGCCC GCAGATTTTC
CTCTCGCGCT CGCATCCCCA GTTCATGGCG AAGCTCTTCG CGCAGGAAGT GCCGGAAATC
TATGATGGCA TCATCGAGAT CAAGGCCGTC GCCCGCGATC CGGGCAGCCG TGCGAAGATC
GCCGTTCTCT CGAACGACTC GTCGATCGAT CCCGTCGGCG CCTGCGTCGG TATGCGCGGC
TCCCGCGTGC AGGCCGTGGT GAACGAATTG CAGGGCGAAA AGATCGACAT CATCCAGTGG
TCGCCCGATG CCGCCACCTT CATCGTCAAT GGCCTCGCGC CGGCCGAAGT CGTCAAGGTC
GTGCTGGACG AAGATGCACA GCGCATCGAA GTCGTCGTGC CGGACGATCA GCTCTCGCTA
GCCATTGGCC GCCGTGGCCA GAACGTGCGC CTCGCCTCGC AGCTCACCGG CTGGGATATC
GACATCCTGA CCGAGGCCGA AGAGAGCGAA CGCCGTCAGG CCGAATTCCT CAGCCGCACG
GCAACCTTCG CCGAGGCGCT TGATGTCGAC GAGATGATCG CGCAGCTCCT CGCCTCCGAA
GGCTTCGCCT CGATCGAGGA AGTGGCTTAT GTCGACCTCG ATGAAATCGC CGAAATCGAA
GGCTTCGACG AAGACACCGC TCAGGAAATC CAGAGCCGCG CGCTCGAATA TATCGAACGC
CAGAACGCGG AGTTCGATGC GAAGCGCCGT GAGCTCGGCG TCGCCGATGA AGTGGCCGAC
GTGCCGGGCG TCACGCCGAA GATGATGGTT GCCTTCGGCG AGAACGACGT GAAGACGGTC
GAGGACCTTG CCGGCTGTGC CACCGACGAT CTCATCGGCT GGAACGAGAC GGTGAACGGC
GAGCGCAAGC ATCAGCAGGG TATCATCGAG GGCTTCGATC TGACCGCCGA GGAGGCGAAC
GACCTCATCA TGCAGGCGCG CCTCAAGGCC GGCTGGATCA CCGAGGCCGA TCTCGCCTCC
GACGAAGAGG AAGAGGCCGG TGAAACGGAC GAGGTTGGTG AAGAAGCCTG A
 
Protein sequence
MAVSANRLEL LQIADAVARE KSIDREVVIE AMAEAIQKAA RSRYGAENEI RARINPQTGE 
IRLVRLLEVV DTVENDAVQI DLKSAQRRNP AAQLGDLIED ELPPVDFGRI AAQTAKQVIV
QKVRDAERER QYDEYKDRIG EIVNGIVKRV EYGNVIVDLG RGEAIVRRDE LLPRETFRNG
DRVRAYIYDV RREQRGPQIF LSRSHPQFMA KLFAQEVPEI YDGIIEIKAV ARDPGSRAKI
AVLSNDSSID PVGACVGMRG SRVQAVVNEL QGEKIDIIQW SPDAATFIVN GLAPAEVVKV
VLDEDAQRIE VVVPDDQLSL AIGRRGQNVR LASQLTGWDI DILTEAEESE RRQAEFLSRT
ATFAEALDVD EMIAQLLASE GFASIEEVAY VDLDEIAEIE GFDEDTAQEI QSRALEYIER
QNAEFDAKRR ELGVADEVAD VPGVTPKMMV AFGENDVKTV EDLAGCATDD LIGWNETVNG
ERKHQQGIIE GFDLTAEEAN DLIMQARLKA GWITEADLAS DEEEEAGETD EVGEEA