Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4809 |
Symbol | |
ID | 5902271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5203721 |
End bp | 5204830 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565329 |
Product | hypothetical protein |
Protein accession | YP_001686427 |
Protein GI | 167648764 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00137373 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCA GCGAACAACG GACCATTTTC CGCCTCTTCG CCCTGGGCGC CGCGATCATG GTCGCGACCC TGGCGGCGCT GCTCGTCCTG CCTCACTCCC GCTATGTGCT TTGGCAAGAC CTGAGGACCG AGGCCTATAC CCGGTTCGGC TGGATCTACG AGCGGGTTCA CTTCGACCAG ACGCCGATCG ACATCGCCTT CGTCGGCACC TCTCACACCA TGAACGGGGT CGACGCCGCC GCCGTGGCGC GGGCGCTGGC GGCTGGCGGG GCCCAGGTCG ACGGCGGGCG CTGCCCGACC GCCACCAACT TCGCCATGCC GTCCTACGGC CGCAACCTGC ACTGGCTGAT CGCCCGCGAA GTGCTCGAGA ACCGGCCGGT CAAGGTGCTG GTGCTGGAGG TCTTCGAGAA CGAGACCCGC AAGGCCCACC CCGTATTTTC GCACGTGGCC GACGCGAAGG ACATTCTCGG CGCGCCCCTG CTGATCAACC TCAACTATGC GCATGACATC GTCCGGCTGC CGTTCCGCCA GGCGTCCCTG GCCGTCGAGA GCCTGGCGCC GGCGCAGTTT GGCCTGAAGT CGCGCTTCGA TCCGGCCGAC TACGACGGCT CGACCGTCGA CAACACCCGG GTCGTCAACG CCGACGGCGT GGCGCTGACG CCGCCGCGGA CCGAGGTCTT CGATCCGGCG AAGCTGGACG CCACGGCTCG CGCCGAGGCG GGCTCCAAGA ACCTGAACAT GATGGGCAAG CGTTTCGAAG CGCTGGAGTA CGCCTATCCC CGCTATTACG TGAACCAGAT CCTGGACCTG GCCAAGGCCA AGGGCGTCAA GGTCGTGTTC CTGTACCTGC CGGGCTACGG CAAGCCGCCG CAGCCCTACG ACATGAGCCT CTATGCCGGC CGAGGCCCGA TGATCTCGGC AAACGACCTT CTGGCCCGCA AGGACTACTG GTTCGACGCC GCACACCTCA ATGCCAATGG CGCTCAGGCC CTGTCTGGCC GCCTGGCCCC GCTGCTGGCC AATCAGTTCG CCGGTGGCGG CGCGGCCAAC CGGGTCTGCG ACTTCGGCTA TGCGCCGCGC AAGACACTGA AGCCCTTCAC CCACCCGTAG
|
Protein sequence | MTPSEQRTIF RLFALGAAIM VATLAALLVL PHSRYVLWQD LRTEAYTRFG WIYERVHFDQ TPIDIAFVGT SHTMNGVDAA AVARALAAGG AQVDGGRCPT ATNFAMPSYG RNLHWLIARE VLENRPVKVL VLEVFENETR KAHPVFSHVA DAKDILGAPL LINLNYAHDI VRLPFRQASL AVESLAPAQF GLKSRFDPAD YDGSTVDNTR VVNADGVALT PPRTEVFDPA KLDATARAEA GSKNLNMMGK RFEALEYAYP RYYVNQILDL AKAKGVKVVF LYLPGYGKPP QPYDMSLYAG RGPMISANDL LARKDYWFDA AHLNANGAQA LSGRLAPLLA NQFAGGGAAN RVCDFGYAPR KTLKPFTHP
|
| |