Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_0166 |
Symbol | nusA |
ID | 7388330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 155168 |
End bp | 156784 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643649886 |
Product | transcription elongation factor NusA |
Protein accession | YP_002548104 |
Protein GI | 222147147 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.590136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTCA GTGCTAATCG GCTCGAGCTG TTGCAGATCG CGGATGCGGT CGCACGCGAA AAAGTCATCG ACCGCGAAAT CGTGCTTGCT GCGATGGCTG ACGCCATCCA GAAGGCCGCG CGTTCGCGTT ACGGTTCCGA GACCAATATT CGCGCCGACA TCAACTCCAA GACCGGCGAG ATTCGTCTGC AGCGCCTGCT CGAAGTGGTC GAAACCGTCG AGGATTACGG CACCCAGATC GCACTCGAAC TGGCGCGCGA CCGCAATGTT GACGCCAAGC TCGGTGATTA CATTGCCGAT CCGCTGCCGC CGATGGATTT CGGTCGGATC GCCGCCCAGT CCGCCAAGCA GGTTATCGTG CAGAAAGTGC GCGAAGCCGA GCGTGATCGG CAGTTCGATG AGTTCAAGGA TCGCGTCGGC GAAATCATCA ACGGCACGGT CAAGCGCGTC GAATATGGCA ATGTCATCGT CGATCTCGGT CGTGGCGAAG GCATTATCCG CCGCGATGAA ATGATCCCGC GCGAAACCAT GCGTTATGGT GACCGCGTTC GCGCCTATGT CTATGACGTG CGCCGCGAAC AGCGTGGCCC GCAGATTTTC CTGTCGCGCA CCCATCCGCA GTTCATGGTG AAACTGTTCA CCATGGAAGT GCCGGAAATT TACGACGGCG TCATCCAGAT CAAGTCGGTG GCCCGTGACC CGGGTTCTCG CGCCAAGATC GCAGTGATTT CCAACGACAG CTCGATCGAT CCGGTCGGTG CTTGCGTCGG TATGCGCGGT TCGCGTGTTC AGGCCGTGGT TGGTGAATTG CAGGGCGAAA AGATCGATAT CATTCCGTGG TCGGCAGACC CGGCATCCTT CATCGTCAAC GCGCTTCAAC CGGCAGAAGT TGCCAAAGTG GTGCTGGACG AAGATGCCGA GCGCATCGAA GTGGTTGTTC CCGATGAGCA GCTGTCGCTG GCCATCGGCC GTCGCGGCCA GAATGTCCGC CTGGCGTCGC AGCTGACCGG CTGGGATATC GACATCATGA CGGAAGCGGA AGAATCCGAG CGCCGCCAGA AGGAATTCAA CGAGCGCACC AACCTGTTCA TGGATGCGCT CGACGTCGAT GAAATGGTTG GCCAGGTTCT GGCGTCGGAA GGCTTTGCCC AGGTTGAAGA GCTTGCCTAT GTCGATCTGG AGGAAATCGC TTCCATCGAT GGTTTTGACG GCGATACCGC AGAAGAAATC CAGACCCGCG CCCGCGAATA TCTCGAAAAG CTGGAAGCCG AACTTGATGC CAAGCGCAAG GCGCTCGGCG TGTCCGACGA GCTGCGCAGC ATTGATGGCA TGACAACGCA AATGTTGGTT GCGCTCGGGG GAGACGGCAT CAAGACGGTC GAGGATTTCG CCGGCTGCGC TGCTGACGAC CTGATCGGCT GGAGTGAGCG TAAGGATGGC GAGACCAAGA AATTCGAGGG CCTGTTCTCG AAAATCGACA TTTCTCGCAC CGAAGCGGAA CAGATGATCG TTCAGGCTCG CCTGGCGGCT GGCTGGATCA CCGAAGCGGA TATTGCCGCC GAGGCTGAGG CAGAAGTCGT CGAGGATGAG GCGCAAGAGG CTGAGCAGGG CTCGTGA
|
Protein sequence | MAVSANRLEL LQIADAVARE KVIDREIVLA AMADAIQKAA RSRYGSETNI RADINSKTGE IRLQRLLEVV ETVEDYGTQI ALELARDRNV DAKLGDYIAD PLPPMDFGRI AAQSAKQVIV QKVREAERDR QFDEFKDRVG EIINGTVKRV EYGNVIVDLG RGEGIIRRDE MIPRETMRYG DRVRAYVYDV RREQRGPQIF LSRTHPQFMV KLFTMEVPEI YDGVIQIKSV ARDPGSRAKI AVISNDSSID PVGACVGMRG SRVQAVVGEL QGEKIDIIPW SADPASFIVN ALQPAEVAKV VLDEDAERIE VVVPDEQLSL AIGRRGQNVR LASQLTGWDI DIMTEAEESE RRQKEFNERT NLFMDALDVD EMVGQVLASE GFAQVEELAY VDLEEIASID GFDGDTAEEI QTRAREYLEK LEAELDAKRK ALGVSDELRS IDGMTTQMLV ALGGDGIKTV EDFAGCAADD LIGWSERKDG ETKKFEGLFS KIDISRTEAE QMIVQARLAA GWITEADIAA EAEAEVVEDE AQEAEQGS
|
| |