Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4202 |
Symbol | |
ID | 5901664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4565729 |
End bp | 4567168 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641564724 |
Product | hypothetical protein |
Protein accession | YP_001685824 |
Protein GI | 167648161 |
COG category | [R] General function prediction only |
COG ID | [COG2270] Permeases of the major facilitator superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.246317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAGT TCTCGGCCGC CGCGGCGGTT GCGGCGGAGG GTGAACCGAT CGCCCCGCTG AAGTCCGGCC TGTCGCGCGG CGCGCTCGCC TGGATCCTGC AGCAGGGCGC GCGCGACCCC TATGTGATCC TGATCACCAT CTACATCTTC TCGCCCTACT TCTCGCGGGT GCTGGTCGGC GATCCGGTCA AGGGTCAAGC CGTGGTGGCC AATCTCTCGA CGATCTACGG GGTGCTGACC GCCCTGACCG CCCCGCTGCT GGGGGCGATG ATCGAGCAGT ACGGACCGCG CAAGCCGATG CTGGGCCTGG TGCTGGGAGT GATGGTCCCG GCGCTGGCGG CCCTGTGGTG GGCCATGCCG GTAGGCGGAC TGCCGCTGAT GGTGACCAGC GCCGCGCTGA TCGTGCTGGG CCTGGTCTAT AATTGGGGCG ACGTGCTGAG CAATTCGCTG CTGGGCCGGG CCGCTGGTCC CGTTCCAGGT CGCGCCGCCC TGGTCTCGGG CCTGGGCTAC GCGGTGGCCA ACGGCCTGTC GGTGGCGCTG CTGGTGTTCA TGCTCTGGGG CATGGTGCTG CCGGGCCAGG TTAACTGGCC GGGCGTGCCG CACGCGCCGC TGTTTGGCCT GGACGCCAGC AAGAACGAAC CCAGCCGGAT CTCCGGTCCG ATGGCGGCGG CGGTGATGCT GCTGGGCGCG ATCCCTTTCT TCCTGTGGAC CCCCGACGCC GCGCGCACCG GCCGCAGCTG GATGGCCAGC ATGCGGGCCG GGATCGCCAT GCTGCGCGAC ATCTTCGGCA ATCTGCGGGG CCATCGCGAC GTCGCCCTCT TTCTGGGCGG ACGCATGCTC TACTGCGACG GCATGACCGC GCTGCTGGTG TTCGGCGGGC TGCTGGCGGC GGGCCTGATG CGCTGGGGCG CGCTGGAGAT GCTGGCCTAC GGTATCTGCC TGAGCATCTT CGGCGTGGTC GGCGGCCTCG TCGCGCCGTG GTTCGACCGC ACCCTGGGTC CGCGCAAGGC GGTGCAGCTG GAGATCGCCG CCTCGCTGCT GATCCTGATC GCCACCCTGG GCATGGGGCG CGAGAAGATC CTCTATTTCT GGGCCTACGA TCCGGCCGCC CATGCTCCGG TCTGGAACGG GCCGCTGTTT CGCACGGCGC CGGAGCTGGT CTATCTGGGC CTGGGCCTGC TGATCGCGGT GTTCGTCACC GCGCAGTACG CCTCCAGCCG CACCCTGCTG ATCCGCCTGT GCCCGCCCGA CAAGACGGCG GCGTTCTTCG GCCTCTACGC GCTGTCGGGC ACCGCCACCA TGTGGATCGG CTCGCTGCTG GTCGCCCTAG CCACCGCCAT CTTCAAGAGT CAGATCGGCG GCTTCCTGCC GGTCGCGGCC CTGCTGCTGC TGGGCTTCTG CGTGCTGTTC TGGGTCAAGG GCGGCGAGCG CGAGGGCTGA
|
Protein sequence | MSEFSAAAAV AAEGEPIAPL KSGLSRGALA WILQQGARDP YVILITIYIF SPYFSRVLVG DPVKGQAVVA NLSTIYGVLT ALTAPLLGAM IEQYGPRKPM LGLVLGVMVP ALAALWWAMP VGGLPLMVTS AALIVLGLVY NWGDVLSNSL LGRAAGPVPG RAALVSGLGY AVANGLSVAL LVFMLWGMVL PGQVNWPGVP HAPLFGLDAS KNEPSRISGP MAAAVMLLGA IPFFLWTPDA ARTGRSWMAS MRAGIAMLRD IFGNLRGHRD VALFLGGRML YCDGMTALLV FGGLLAAGLM RWGALEMLAY GICLSIFGVV GGLVAPWFDR TLGPRKAVQL EIAASLLILI ATLGMGREKI LYFWAYDPAA HAPVWNGPLF RTAPELVYLG LGLLIAVFVT AQYASSRTLL IRLCPPDKTA AFFGLYALSG TATMWIGSLL VALATAIFKS QIGGFLPVAA LLLLGFCVLF WVKGGEREG
|
| |