Gene Caul_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0074 
Symbol 
ID5897786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp87536 
End bp91186 
Gene Length3651 bp 
Protein Length1216 aa 
Translation table11 
GC content66% 
IMG OID641560557 
Producthypothetical protein 
Protein accessionYP_001681710 
Protein GI167644047 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATGG GAATGGGGCG CGCGCGAATG TTCGGAAAAG TCGCCGCGTT CGAGTTTCGC 
TACCAGACGC GCCAGCCGGT CTTCTGGGTC GGGGTGATCA TCTTCTTCCT GCTCTCGTTC
GCCTCGGTCG CAGCGCCTGA GTTCGTGCAG TTTGGATCGA CGGCCAATGT TCACAAGAAC
GCCGCCTCGG TGATCGCCAG CGGCAACCTG AACCTGGCCC TGATCTACAT GTTCATCACG
GCCGCCTTCG TGGCCAACAT CATCGTGCGC GACGAGGACA CCGGCTTCGA CGGCATCCTG
CGCTCCACCC CGCTGACCAA GTTCGACTAT CTGTACGGCC GGTTCACCGG GGCCTTCGCG
GCGGCGGCGC TGTCGTTCCT GGCCGTGCCC GCCGGCATGG CCCTGGGCGC GGTCATGTGG
TGGGTGGACC CCGAGAGCGT CGGCCCGTTC GTCCTGAACC ACTACCTGTT CGCCTATTTC
GTGCTGGCCC TGCCGTCGCT GCTGTTGACC TCGGCGGTGT TCTTCGCGGC GACCACGGTG
ACACGGTCGA TGATGTGGAC CTATGTCGGC GTCATCGCCT TCATGGTGGC CTCGTTCATC
AGTTCGACCC TGTTGCGCCA GCCCGGCCTG GAGAAGGTCG CCGCGCTCTG GGAGCCCTTC
GGCGGCGGCG CCTATGGCCT CGCGATCCGC TACTGGACAG CGGCCGAGCG CAACACCCAG
ATCCCGCCCC TGGCCGGCTA TCTGCTGGCC AACCGGCTGA TCTGGATCGG CATCGCCCTG
GCGATCATCA CCGCCGCCTA CTGGCTGTTC GACCTGCGCA AGGGCGATGG GGCGACCCGG
GTGAAGCGCG AGAAGCCGGG TCAGGCCGAG ACCCCGCGTC CGACCGGAGT CACGCACGGC
GCGCCCCGCT TTGACGCGGC CTCGCGCTGG GCGCAGCTCT GGGCGCGGGC GCGGCTCGAC
GCGCGGCAGG TGTTCGGCAG CCCGGCCTAT CTGGTGCTGG TCGGCCTGGC CGCCGCCCTG
TCGGTGTTCA ACCTCTGGCG GGTCGCCGAT GACAGCCTCT ATGGGGGGGC GATCTATCCA
GTGACCCGGG CGATGATCCA GGCCTTGAAC GGCATCTTTC CGTTCATTTC GCTGATCATC
GCCTCGTTCT ACGCCGGCGA GCTGGTCTGG CGCGAGCAGG ACCGCAAGAC CCACGAGCTG
ATCGACGTCA CGCCGATCCC CGACTGGGCC TTCGTCGTGC CCAAGACCCT GGCCATCAGT
CTCGTCCTGC TGTCGACCTT CCTGGCCGGC GTGGCCGTCT CGGTGATCAT CCAGCTGATC
AAGGGCTACA CCCACCTGGA GCTGGTGAAC TACATCGTCT GGTGGGTGCT GCCCCAGGCC
GCCGACTGCA TCCTGATGGC TGTGCTGGCG GTGTTTATCC AGGTGCTGTC GCCGCACAAG
TTCGTCGGCT GGGGTCTGAT GGTGCTCTAC ATCATCGCCC TGATCGTCGC CTCCAGCTGG
GGTCTGGAGC ACAATCTGTA TCTCTACGAC GGCTCACCGA TGGTGCCGAT CTCGGACTTC
AACGGGCAGA GCCGGTTCTG GATCGGCGCC TGGTGGTTCC GGCTCTACTG GACGGCCTTC
GCGGTGGTGC TGCTGGTGCT GTGCCACGTG CTGTGGCGGC GCGGAACCGA GAGCCGGTTG
CGCCCGCGAC TGGCCGCGCT GCCGCGCCGC CTGATGGGCG GGCCGGGCGT GATCGCGGGC
GTCGCCCTCG TGGCGTTCAT CGGCGTCGGA GCGTTCATCT ACATCAACAC CAACGTCTGG
AACGCGTACC GCACGACTCT GAGCAACGAG CGCTGGGCCG CCGACTACGA AAAGGCCCTG
CTGCCGTTCG AGACCACGCC GCAGCCGAAG ATCGTCGCGG TCAAACTGGA CGTCGATATC
CGGCCGGGCG CCCCGCGCAT CGACACCAAG GGCGTCTACG AGATCGAGAA CCGCACGGAC
AAGCCGTTGC GCGAGATTCA CGTCCGCTTC GATCGCGACC TCAAGGTCCT GGCCCTGTCG
ATCGAAGGCG CGCGGCCCAA ACGGACCTTC GAGCGGTTCA ACTATCGCAT CTTCGCCTTC
GACACCCCGA TGCAGCCGGG CGAGCGGCGC TCGATGTCGT TCAGCACCGC CCGGGTGGCG
ATGGGCTTCC GCAACAGCGG CGCCGACACC CTGATGGTAG ATAACGGGAC CTTCATCAAC
GACGGCCAGC TGGCCCCGTC TCTGGGCATG GACCGCAACG GCCTGCTGAC CGACCGCAGC
AAGCGGCGCA AGTACGGGCT GAAGCCCGAA CTGCGCCCGG CCAAGCTGGG CGACGTCGCG
TCGCGCCAGT TCAACGGCCT GCGCCGCGAC AGCGACTGGG TGACTTCTGA CATCACCGTC
ACCACCGACG CCGACCAGAC GCCGATGGCG CCGGGCTACA AGGTTTCGGA CACGACGTGG
CATGAATCGG CCAACATGAG CGGCCCGGTC AACAAGGCCG GCGACCGGCG CACCGCGCGG
TTCGTGACCG AAGCCCCGAT CCTGCATTTC TTCTCGATCC AGTCGGCCCG CTACGCGCTG
AGGACCGAGC TCTACAAGGG CGTGCGTCTG TCGGTCTACT ACCACCCGGC CCACGCCTGG
AACGTGGAGC GGATGATCAG CTCGATGAAG CGGTCGCTGG ACTACGACCA GGCCAATTTC
AGCCCCTACC AGTTCCGTCA GTTGCGTTAT CTGGAGTTCC CGGCCTACGG CAATTTCGCC
CAGTCGTTCG CCAACACCAT TCCCTGGTCC GAGAACCTGG GCTTCGTGTC CAAGTACGAG
GACCCGACCA AGATCGACAT GGTCACCTAT GTCGGCGCGC ATGAGATCGC CCACCAATGG
TGGGCGCACC AACTGATCGG CGCCGACCAG CAGGGCGGCG CGGCCTTGGC CGAGACCCTG
GCTCAGTACT CGGCGCTGAT GGTGATGAAG AAGATCTACG GCGAGCCGAT GATCCGCAAA
TTCCTCAAGT ACGAGCTGGA CCGCTATCTG CGGGCTCGGG GCGGCGAGGT GATCGAGGAG
CTGCCGCTGC GCCAGGTCGA AGACCAGCCC TACATCTACT ACAACAAGGG CTCGCTGGTG
ATGTACCGGC TGGCCAGCGA GATTGGCGAG GACAACGTCA ACGCCGCCCT GCGCGACATG
CTGGCCGCCT ACGCCTTCAA GGGTCCGCCC TATCCGACCA CCCTGGAACT GGTCGCGGCC
CTGCGCCGCC ACGCCCCCGC CGACAAGCAG GCCCTGATCA CCGACCTGTT CGAGAAGATC
ACGCTGTACG ACCTGAAGAC CACGGCGGCG ACGGTGAAGA AGCGCCCCGA TGGCCGGTTC
GACGTCACCC TGACGGTGAC GGCCAAGAAG CTCTACGCCG AGGGCCGCGG CCAGGAGAAG
GAAGCGCCGA TGAGCGAGCC GATGGACATC GGCCTGTTCA CGCTCGAGCC GGGCAAGAAG
GGTTTCGGGG CCGACAAGGT CGTGGCCTTC GAGCGCCGGA CCATCACGTC GGGGACCCAG
ACCCTGGGCT TCGTCACGTC GGTCGCGCCG AAGGCGGCGG GGGTGGATCC CTACAACATG
GTCATTGACC GCAATGGTGA CGACAACATC ACCAAGGTGG AGATGAGATG A
 
Protein sequence
MRMGMGRARM FGKVAAFEFR YQTRQPVFWV GVIIFFLLSF ASVAAPEFVQ FGSTANVHKN 
AASVIASGNL NLALIYMFIT AAFVANIIVR DEDTGFDGIL RSTPLTKFDY LYGRFTGAFA
AAALSFLAVP AGMALGAVMW WVDPESVGPF VLNHYLFAYF VLALPSLLLT SAVFFAATTV
TRSMMWTYVG VIAFMVASFI SSTLLRQPGL EKVAALWEPF GGGAYGLAIR YWTAAERNTQ
IPPLAGYLLA NRLIWIGIAL AIITAAYWLF DLRKGDGATR VKREKPGQAE TPRPTGVTHG
APRFDAASRW AQLWARARLD ARQVFGSPAY LVLVGLAAAL SVFNLWRVAD DSLYGGAIYP
VTRAMIQALN GIFPFISLII ASFYAGELVW REQDRKTHEL IDVTPIPDWA FVVPKTLAIS
LVLLSTFLAG VAVSVIIQLI KGYTHLELVN YIVWWVLPQA ADCILMAVLA VFIQVLSPHK
FVGWGLMVLY IIALIVASSW GLEHNLYLYD GSPMVPISDF NGQSRFWIGA WWFRLYWTAF
AVVLLVLCHV LWRRGTESRL RPRLAALPRR LMGGPGVIAG VALVAFIGVG AFIYINTNVW
NAYRTTLSNE RWAADYEKAL LPFETTPQPK IVAVKLDVDI RPGAPRIDTK GVYEIENRTD
KPLREIHVRF DRDLKVLALS IEGARPKRTF ERFNYRIFAF DTPMQPGERR SMSFSTARVA
MGFRNSGADT LMVDNGTFIN DGQLAPSLGM DRNGLLTDRS KRRKYGLKPE LRPAKLGDVA
SRQFNGLRRD SDWVTSDITV TTDADQTPMA PGYKVSDTTW HESANMSGPV NKAGDRRTAR
FVTEAPILHF FSIQSARYAL RTELYKGVRL SVYYHPAHAW NVERMISSMK RSLDYDQANF
SPYQFRQLRY LEFPAYGNFA QSFANTIPWS ENLGFVSKYE DPTKIDMVTY VGAHEIAHQW
WAHQLIGADQ QGGAALAETL AQYSALMVMK KIYGEPMIRK FLKYELDRYL RARGGEVIEE
LPLRQVEDQP YIYYNKGSLV MYRLASEIGE DNVNAALRDM LAAYAFKGPP YPTTLELVAA
LRRHAPADKQ ALITDLFEKI TLYDLKTTAA TVKKRPDGRF DVTLTVTAKK LYAEGRGQEK
EAPMSEPMDI GLFTLEPGKK GFGADKVVAF ERRTITSGTQ TLGFVTSVAP KAAGVDPYNM
VIDRNGDDNI TKVEMR