Gene Caul_5019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5019 
Symbolrho 
ID5902481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5421889 
End bp5423310 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content64% 
IMG OID641565540 
Producttranscription termination factor Rho 
Protein accessionYP_001686637 
Protein GI167648974 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000339489 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC AAACAGAGAA CCAGACCGAC AGCGCCAACG AGGCCGAAGA GCCGATCGTC 
GATACGACGA CCCTGGCCGC CTCGGTCGAT CCGCAAGGCG ACGACAATGG CGGCGACGAC
GAATCCGAAG TGGGCGCCAC CGTAGCGGCC ATGGGCCTGA AGACGATGTC GCTGCAGGAG
CTGAAGGAGA AATCCCCGGC CGACCTGCTG GCCTTCGCCG AGACCTTCGA GGTCGAGAAC
GCCAACTCCA TGCGCAAGCA GGACATGATG TTCGCGATCC TCAAGACCCT CGCCGAAGAA
GGCGTGGAAA TCTCGGGCTC GGGGACCATG GAAGTGGTGC AGGACGGCTT TGGCTTCCTG
CGCTCGCCGG AAGCCAACTA TCTTCCGGGT CCGGATGATA TCTACGTGTC GCCCTCGCAA
ATCCGCAAGT TCGGCCTGCG CACCGGCGAC ACCATCGACG GCGCCATCCG CGCGCCCCGC
GAGGGCGAGC GCTACTTCGC CCTCACCGGC GTGACCTTGA TCAATTTCGA GAGCCCGGAC
AACGTCAAGC ACAAGGTCCA CTTCGACAAC CTGACCCCGC TCTATCCCGA GGAGCGGCTG
AACATGGAAC TGCCCGATCC GACCATCAAG GATCGCTCGG GCCGGGTCAT CGACATCGTC
GCCCCGCTGG GCAAGGGTCA GCGCTGCCTG ATCGTCGCCC CGCCGCGCGT CGGCAAGACG
GTGATGCTGC AGAACATCGC CAAGTCGATC GAGACCAACC ACCCCGAGTG CTACCTGATC
GTCCTGTTGA TCGACGAGCG CCCGGAAGAA GTCACCGACA TGCAACGCAC GGTAAAGGGC
GAGGTCATCG CCTCGACCTT CGACGAACCG GCGACCCGCC ACGTGCAGGT GGCCGAGATG
GTCATCGAAA AGGCCAAGCG CCTGGTCGAG CACAAGCGCG ACGTGGTCAT CCTGCTGGAC
TCGGTCACCC GCCTGGGCCG CGCCTACAAC ACCACCGTCC CGTCGTCGGG CAAGGTGCTG
ACCGGCGGCG TCGACGCCAA CGCCTTGCAG CGCCCCAAGC GCTTCTTCGG CGCGGCGCGG
AACGTCGAGG AGGGCGGCTC GCTGTCGATC ATCGCCACCG CCCTGATCGA CACCGGCAGC
CGGATGGACG AAGTGATCTT CGAAGAGTTC AAGGGCACCG GTAACTCGGA AATCGTTCTT
GATCGTAAGG TGGCGGACAA GCGCATCTTC CCGGCCATCG ACGTGTTGAA GTCGGGCACC
CGCAAGGAAG AGCTGATCAC GCCGCGGGAC CAATTGCAGA AGACCTACGT TCTGCGCCGG
ATCCTCAACC CGATGGGCGC CTCGGACGCC ATCGAGTTCC TGCTCGAGAA GATGCGCCAG
TCAAAGACCA ACGGCGATTT CTTCCAGTCG ATGAACACCT AG
 
Protein sequence
MTDQTENQTD SANEAEEPIV DTTTLAASVD PQGDDNGGDD ESEVGATVAA MGLKTMSLQE 
LKEKSPADLL AFAETFEVEN ANSMRKQDMM FAILKTLAEE GVEISGSGTM EVVQDGFGFL
RSPEANYLPG PDDIYVSPSQ IRKFGLRTGD TIDGAIRAPR EGERYFALTG VTLINFESPD
NVKHKVHFDN LTPLYPEERL NMELPDPTIK DRSGRVIDIV APLGKGQRCL IVAPPRVGKT
VMLQNIAKSI ETNHPECYLI VLLIDERPEE VTDMQRTVKG EVIASTFDEP ATRHVQVAEM
VIEKAKRLVE HKRDVVILLD SVTRLGRAYN TTVPSSGKVL TGGVDANALQ RPKRFFGAAR
NVEEGGSLSI IATALIDTGS RMDEVIFEEF KGTGNSEIVL DRKVADKRIF PAIDVLKSGT
RKEELITPRD QLQKTYVLRR ILNPMGASDA IEFLLEKMRQ SKTNGDFFQS MNT