Gene Avin_47410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_47410 
Symbolrho 
ID7763604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4814265 
End bp4815524 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID643807586 
Producttranscription termination factor Rho 
Protein accessionYP_002801821 
Protein GI226946748 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTGA CCGAACTCAA GCAAAAGCCT ATTACCGAAC TGCTGGAAAT GGCCGAACAG 
ATGGGCCTGG AGAACATGGC CCGTTCGCGC AAGCAGGATG TGATCTTCTC CCTGCTGAAG
AAGCATGCGA AAAGCGGCGA GGAAATCTCC GGTGATGGCG TGCTGGAGAT TCTTCAGGAT
GGTTTCGGAT TCCTGCGCAG CGCGGATTCC TCCTATTTGG CCGGGCCGGA CGACATCTAC
GTCTCGCCGA GCCAGATCCG CCGCTTCAAC CTGCGCACCG GCGACACCAT CGTCGGCAAG
ATCCGCCCGC CGAAGGAGGG CGAGCGCTAC TTCGCGCTGC TCAAGGTCGA CTCGATCAAC
TTCGACCGGC CGGAAAACGC GAAGAACAAG ATCCTCTTCG AAAACCTCAC CCCGCTGTTC
CCCAATGCTC GTCTCACCAT GGAAGCCGGC AACGGCTCCA CCGAAGACCT CACCGGCCGC
GTGATCGACC TCTGCGCGCC GATCGGCAAG GGCCAGCGTG GCCTGATCGT CGCGCCGCCG
AAGGCCGGCA AGACGATCAT GCTGCAGAAC ATCGCCAGCA ACATCACGCG CAACAATCCC
GAGTGTCACC TGATCGTCCT GCTCATCGAC GAGCGTCCGG AAGAGGTGAC CGAGATGCAG
CGCACCGTGC GCGGCGAAGT GGTCGCCTCC ACCTTCGACG AGCCGCCGAC CCGCCACGTG
CAGGTCGCCG AGATGGTCAT CGAGAAGGCC AAGCGCCTGG TCGAACACAA GAAGGACGTG
GTGATTCTGC TCGACTCCAT CACCCGCCTG GCGCGCGCCT ACAACACCGT GATCCCCAGC
TCCGGCAAGG TACTCACCGG CGGTGTCGAC GCCCATGCAC TGGAAAAGCC CAAGCGTTTC
TTCGGTGCCG CGCGCAACAT CGAGGAGGGC GGCAGCCTCA CCATCCTCGC CACTGCGCTG
GTGGAAACCG GCTCGAAGAT GGACGAAGTG ATCTACGAGG AGTTCAAGGG TACGGGCAAC
CTGGAGCTGC AGCTCGACCG GCGCGTCGCC GAGAAGCGTG TGTTCCCGGC GATCAACATC
AATCGCTCCG GTACCCGCCG CGAAGAACTC CTGACCGGTG AAGAGGAACT GCAGCGCATG
TGGATCCTGC GCAAGATCCT GCATCCCATG GACGAGATCG CCGCCATCGA ATTCCTGCTC
GATCGTCTGA AGGATACCAA GACCAACGAG GAATTCTTCC AGTCGATGAA GCGGAAGTAA
 
Protein sequence
MNLTELKQKP ITELLEMAEQ MGLENMARSR KQDVIFSLLK KHAKSGEEIS GDGVLEILQD 
GFGFLRSADS SYLAGPDDIY VSPSQIRRFN LRTGDTIVGK IRPPKEGERY FALLKVDSIN
FDRPENAKNK ILFENLTPLF PNARLTMEAG NGSTEDLTGR VIDLCAPIGK GQRGLIVAPP
KAGKTIMLQN IASNITRNNP ECHLIVLLID ERPEEVTEMQ RTVRGEVVAS TFDEPPTRHV
QVAEMVIEKA KRLVEHKKDV VILLDSITRL ARAYNTVIPS SGKVLTGGVD AHALEKPKRF
FGAARNIEEG GSLTILATAL VETGSKMDEV IYEEFKGTGN LELQLDRRVA EKRVFPAINI
NRSGTRREEL LTGEEELQRM WILRKILHPM DEIAAIEFLL DRLKDTKTNE EFFQSMKRK