Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3785 |
Symbol | |
ID | 5901247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4104732 |
End bp | 4105829 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641564308 |
Product | oxidoreductase domain-containing protein |
Protein accession | YP_001685410 |
Protein GI | 167647747 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000208707 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACATCG GACGCCGCGA CCTCCTTCTC GCCGCCGCTT CTCTCGCCGT CGCCTCAGCC GCGGGAGGGG CCCGCGCCGC CAGCGACCGC AAGATCGGCT ACGCGATCGT CGGCCTGGGC TATTACGGCC TCAACGTCAT CCTGCCGCAG TTCGTCAACT GCGAGCACAG CCGGGTCACG GCCCTGGTCA GCGGCGACCC GGCCAAGGCT CGCGCGACCG CGGCGCGCTA CGGAGTGCCG GAGCGCTCGA TCTATTCCTA CGAGACCTTC GACCAGATCC GCGACAATCC CGACGTCGAT GTCGTCTATG TCATCCTGCC CAATTCCATG CACGCCGAGT ACACGATCCG CGCCGCCAAG GCCGGCAAGC ATGTGATGTG CGAGAAGCCG ATGGCGACGT CGGTCGCCGA GTGCGAGGCG ATGATCGCCG CCTGCAAGAC GGCTGGGCGC AAGCTGATGA TCGGCTATCG CTGCCATTTC GAGGCCACCA ATCTCGAAGC CGTGCGCCTG GCCCGCGCGG GCGCGGCCGG CCACGTCCGC TATGTGCGCT CCGAGCACGG CTTCGTGCAG GGCGACCCGT CGAAGTGGCG GTTGAAGAAG GCGATGGCGG GCGGCGGGTC GCTGATGGAC ATGGGCGTCT ACAGCCTGCA GGCGGCGCGA TACATGACCG GCGAGGAGCC CGTCTCGGTC ACGGCGCGCG AGTCCACCGA TCGCGGCGAT CCCCGGTTCA CCGAGGTGGA GGACATGATC GAATGGCTGC TGAAGTTTCC GTCCGGGGCG ATCGCCAGCT GCCTGTCGAC CTACAGCGCA AATCAGAACC ACGTCCTGCT GATGGGCGAC AAGGGACGGA TCGAGATGGA GCCCGCCACC CGCTATGACG GCAATCGCCT GTGGACCGGC AGGGACGGCC GCGCGGACGA AATCGCGCCG CCGCCTGGCC CGGCCAAGAC CCAGTTCGCC GGCCAACTGG ATCACCTGGC CGATTGCATT CGAACCAACC GCACGCCGAT CGTTTCGGGA GAAGAGGGCC TGCGCGACAT GCGGATCATC GAGGCGATCT ACCGGTCGGC GCGGGAGGAA AGCACGATCA AGCTGTAG
|
Protein sequence | MDIGRRDLLL AAASLAVASA AGGARAASDR KIGYAIVGLG YYGLNVILPQ FVNCEHSRVT ALVSGDPAKA RATAARYGVP ERSIYSYETF DQIRDNPDVD VVYVILPNSM HAEYTIRAAK AGKHVMCEKP MATSVAECEA MIAACKTAGR KLMIGYRCHF EATNLEAVRL ARAGAAGHVR YVRSEHGFVQ GDPSKWRLKK AMAGGGSLMD MGVYSLQAAR YMTGEEPVSV TARESTDRGD PRFTEVEDMI EWLLKFPSGA IASCLSTYSA NQNHVLLMGD KGRIEMEPAT RYDGNRLWTG RDGRADEIAP PPGPAKTQFA GQLDHLADCI RTNRTPIVSG EEGLRDMRII EAIYRSAREE STIKL
|
| |