Gene Bind_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1049 
Symbol 
ID6198846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1204755 
End bp1206371 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content60% 
IMG OID641705042 
ProductNusA antitermination factor 
Protein accessionYP_001832181 
Protein GI182678035 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA
[TIGR01954] transcription termination factor NusA, C-terminal duplication 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTGA GCGCCAATAG GCTCGAGCTT CTGCAAATCG CCGATGCCGT CGCGCGAGAG 
AAATCGATCG ACCGGCAGAT CGTCCTGGCC TCCATGGAGG ACGCGATCCA GAAAGCTGCG
CGCTCGCGCT ACGGTCAGGA GACCGAAGTG CGCGCTGAGA TCAATCCGAA GACCGGAGAA
ATCCGCTTCT CGCGATTGCT GCATGTCGTC GATGAGATCG ATAATGACGC GGTGCAGATC
ACCCTCGCGG AGGCCCGCAA GAAGAATCCC GCCGCCGAAC TTGGCGACTG GATCGCCGAA
ACCCTGCCGC CCTTCGACTT CGGCCGCATC GCCGCCCAAT CGGCGAAGCA GGTGATCGTT
CAGAAGGTGC GCGAGGCCGA GCGCGATCGG CAATATCAGG AATATAAGGA CCGCATTGGC
GATATCGTCA ACGGCGTCGT CAAGCGCGTC GAATATGGCA ATGTCATCAT TGATCTCGGG
CGTGGCGAGG CGACCATCCG GCGCGACGAG ATGATCCCGC GCGAAGTGTT CCGCCCCGGC
GATCGCGTGC GTGCCTATGT CTATGACGTG CGGCGCGAGC AGCGCGGACC CCAGATTTTC
CTCTCCCGCA CGCATCCCCA ATTCATGGCC AAGCTGTTCC GTCAGGAAGT GCCTGAAATC
TACGACGGCG TGATCGAGGT GAAGGCCGTG GCGCGCGATC CAGGCTCGCG CGCCAAGATC
GCCGTCATCT CGCGCGATAC GTCGATCGAT CCGGTCGGTG CCTGCGTCGG CATGCGCGGC
TCGCGCGTTC AGGCGGTCGT CAATGAATTG CAGGGCGAGA AGATCGACAT CATTCCCTGG
TCGCCCGATG CCGCCACCTT CATCGTCAAT GCCTTGCAGC CGGCGGAAGT CGTCAAGGTC
GTGCTCGACG AGGATTCAGC GCGTATTGAA GTCGTGGTTC CGGATGACCA ATTATCCTTA
GCAATCGGCC GTCGTGGCCA GAATGTCCGC CTTGCCTCGC AATTGACCGG CTGGGATATC
GACATTTTGA CCGAGGCTGA GGAATCGGCC CGCCGCCAGA AGGAATTCAC CGAGCGTACC
GCCATGTTCA TGGGTGCGCT CGACGTTGAT GAAGTGGTCG GCCAATTGCT TGCCTCGGAA
GGCTTCCGTT CGGTCGAGGA ACTTGCTTTC GTCGAACCTT CCGAACTTGC GGTGATCGAA
GGTTTCGACG AGGAAACGGC GGCTGAGATT CAGGCCCGCG CCAATGCCTA CCTCGCCCGC
ATCGAGGCTG AACATGAAAC GCGCCGGCGC GAGCTCGGTG TTTCCGACGA TCTGCTCGAG
ATCGACGGTT TGACCAATGC CATGCTCGTG AAATTTGGTG AAAACGATAT CAAGACCGTC
GAGGATCTCG CCGGCTGCGC CACCGACGAT CTCGTCGGCT GGAGCGAGCG CAAGGATGGC
GAAACCACAC GCCATCCAGG CATTCTTGAC GGATTTGAAG TGTCGCGCGA GGAAGCCGAG
GGTCTTATTA TGAAGGCGCG GGTGAAAGCC GGCTGGATCG ATGCCCTGCC CGAAGCCTCC
GAACCCGAAC AAGAGACTTT CGCCGAAGCC GAGACGCAAA GCGAAAGCGC TGACTGA
 
Protein sequence
MAVSANRLEL LQIADAVARE KSIDRQIVLA SMEDAIQKAA RSRYGQETEV RAEINPKTGE 
IRFSRLLHVV DEIDNDAVQI TLAEARKKNP AAELGDWIAE TLPPFDFGRI AAQSAKQVIV
QKVREAERDR QYQEYKDRIG DIVNGVVKRV EYGNVIIDLG RGEATIRRDE MIPREVFRPG
DRVRAYVYDV RREQRGPQIF LSRTHPQFMA KLFRQEVPEI YDGVIEVKAV ARDPGSRAKI
AVISRDTSID PVGACVGMRG SRVQAVVNEL QGEKIDIIPW SPDAATFIVN ALQPAEVVKV
VLDEDSARIE VVVPDDQLSL AIGRRGQNVR LASQLTGWDI DILTEAEESA RRQKEFTERT
AMFMGALDVD EVVGQLLASE GFRSVEELAF VEPSELAVIE GFDEETAAEI QARANAYLAR
IEAEHETRRR ELGVSDDLLE IDGLTNAMLV KFGENDIKTV EDLAGCATDD LVGWSERKDG
ETTRHPGILD GFEVSREEAE GLIMKARVKA GWIDALPEAS EPEQETFAEA ETQSESAD