Gene Arth_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2620 
Symbol 
ID4444861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2935375 
End bp2937564 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content68% 
IMG OID639690439 
Producttranscription termination factor Rho 
Protein accessionYP_832099 
Protein GI116671166 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGAAA CCACTGAGCT GTCTCCAGCT GTGGACACAT CTTCTGCTAC CGAGTTGTCG 
GCAGCACCCG CAAAGAGCAG CGGCCTTGCC GGCCTTAAGC TCGCCCAGCT GCAGGCCCTC
GCCAGCCAGT TGGGTATCTC TGGCGGATCC CGCATGCGCA AGGGCGATCT GGTGACCGCC
ATTTCCGCCC ATCGTGCGGG TACTCCAACC ACAAAGGCTC CTGCCAAGGG TGCCGAGAAG
ACCGGCGAGA CTATCTCGGC CCGGGCCACG GCTCCGGCTG CAGCTGCACC TGCAGCGGGC
GCCCCTGCGG CCGGAACCGC AGCGGAAGCC GCCGAGGCTC CCGCCCAGGA AGGCACTCGT
GCCCGGCGCG GCCGCAGCCG CCGCGCAGGC AGCGACGGCG TGATCACCCC TCCCGCCACC
GAGGCTCCGG AAACTGCTCC CAGCGAGGCC CCCGCGGCCG CGTCCGCTCC TGACGCCGGC
CAGGCCGCCG TGTCCGAAGG TGCCGCCCCG GAAGCTGCAG AGCGCCGCCA GCCGCGCACC
CGCAACCGCC GCCGCGGTGA AGCGGCAGCC CAGGCGACTG CACCGGCCGC CGAAAGCGAA
ACACAGGCCG AGGCTCCTGC CGAACAGCGT GTTGCTGAAC AGCGCACCGA ACAGCGCACT
GAACAGCGTG CCGGGGAACA GCGCGAACAG CGCGGTGCAG ACCAGGGCGG CGAAGCAGCC
GACGCCGGTC AGCGCAACGA ACGCCAGTCA ACCCGCACCC GCGGCGGTCG CGATGGTGGA
GACACCTCCG GTGCGCCCCG CCGCGATGAC ACCCGCGACA ACACCCGCGA CAGCGACGAT
TCGGACGGCG GAAGCCGCCG CAACCGCCGC AACCGCCGTG ACCGCAACGA CCGCAACGAC
CGCTCAGGCG GCCAGGGCCA GGACAACCGG GACAACTCCC GCAACGACCG GTTCCGCGAC
CGTAACGACC GCCGTCGTGG ACGCGCCCAG GGACCGGACG TCGACGACGT CGAGGTCACC
GAGGACGATG TCCTGCTGCC CGTCGCCGGC ATCCTGGACG TGCTGGAGAA CTACGCGTTC
ATCCGCACCT CCGGTTACCT GCCGGGTCCG AACGACGTCT ACGTGTCCCT CGCCCAGGTC
AAGAAGTACA ACCTGCGCAA GGGCGACGCC GTGGTCGGCG CCATCCGCGC ACCGCGTGAA
GGCGAAGACC GCAGCCAGCA GTCTGCCCGC CAGAAGTTCA ACGCCCTGGT CCGCGTCACC
TCCGTCAACG GCAAGACCCC CGAGGAGCTC AAGGACCGCG TCGAATTCGC GAAGCTCGTC
CCGCTGTACC CCTCGGAGCG CCTGCGCCTC GAGACGGACC CCAAGAAGAT CGGCCCCCGC
GTCATCGACC TGGTTGCCCC GATCGGCAAG GGCCAGCGCG GCCTGATCGT TTCGCCGCCG
AAGGCCGGCA AGACGCTCAT CCTGCAGTCC ATCGCCAACG CCATCACCAC CAACAATCCT
GAGGTCCACC TCATGATGGT GCTCGTTGAC GAACGCCCTG AAGAAGTCAC GGACATGCAG
CGCACCGTCA AGGGCGAGGT CATTGCCTCC ACCTTCGACC GTCCCGCCGA CGACCACACC
ACCGTGGCCG AGCTCTCCAT CGAACGCGCC AAGCGCCTCG TGGAAATGGG CATGGACGTT
GTGGTTCTCC TCGACTCGAT GACCCGACTG GGCCGTGCCT ACAACCTGGC GGCGCCGGCT
TCCGGCCGTA TCCTCTCAGG TGGCGTCGAC TCGGCAGCAC TGTACCCGCC GAAGCGCTTC
TTCGGTGCCG CCCGCAACAT CGAAAACGGC GGCTCGCTCA CCATCCTGGC CACCGCGCTC
GTCGAGACCG GTTCCAAGAT GGACGAGGTC ATTTTCGAAG AGTTCAAGGG AACCGGCAAC
ATGGAACTGC GCCTGTCCCG CCAGCTCGCG GACAAGCGCA TCTTCCCGGC CGTGGACGTC
AACGCGTCCG GTACCCGCCG CGAAGAGAAC CTGCTTTCCC CCGAGGAAGT CAAGATCATG
TGGAAGCTGC GCCGCGTCCT CTCCGGACTC GAAACGCAGC AGAGCCTTGA GCTGCTGACC
AACAAGATCC GGGAGACGCA GAGCAACGTC GAGTTCCTCA TGCAGGTCCA GAAGACGACG
CTTGGTGCGA AGTCGGATAA CGACAAGTAG
 
Protein sequence
MTETTELSPA VDTSSATELS AAPAKSSGLA GLKLAQLQAL ASQLGISGGS RMRKGDLVTA 
ISAHRAGTPT TKAPAKGAEK TGETISARAT APAAAAPAAG APAAGTAAEA AEAPAQEGTR
ARRGRSRRAG SDGVITPPAT EAPETAPSEA PAAASAPDAG QAAVSEGAAP EAAERRQPRT
RNRRRGEAAA QATAPAAESE TQAEAPAEQR VAEQRTEQRT EQRAGEQREQ RGADQGGEAA
DAGQRNERQS TRTRGGRDGG DTSGAPRRDD TRDNTRDSDD SDGGSRRNRR NRRDRNDRND
RSGGQGQDNR DNSRNDRFRD RNDRRRGRAQ GPDVDDVEVT EDDVLLPVAG ILDVLENYAF
IRTSGYLPGP NDVYVSLAQV KKYNLRKGDA VVGAIRAPRE GEDRSQQSAR QKFNALVRVT
SVNGKTPEEL KDRVEFAKLV PLYPSERLRL ETDPKKIGPR VIDLVAPIGK GQRGLIVSPP
KAGKTLILQS IANAITTNNP EVHLMMVLVD ERPEEVTDMQ RTVKGEVIAS TFDRPADDHT
TVAELSIERA KRLVEMGMDV VVLLDSMTRL GRAYNLAAPA SGRILSGGVD SAALYPPKRF
FGAARNIENG GSLTILATAL VETGSKMDEV IFEEFKGTGN MELRLSRQLA DKRIFPAVDV
NASGTRREEN LLSPEEVKIM WKLRRVLSGL ETQQSLELLT NKIRETQSNV EFLMQVQKTT
LGAKSDNDK