Gene Caul_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3894 
Symbol 
ID5901356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4214649 
End bp4215989 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content71% 
IMG OID641564415 
Producthypothetical protein 
Protein accessionYP_001685517 
Protein GI167647854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.213149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.698237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGGG TCGCCGCCAA CAAGTTGCTG ATGGTTCGGA AGCTGGTGGA AACCGCGCCC 
GACGCCGCCT TGCGTAGCCT CGAGCTCGCC CTGTCCGGCC CGGCGGGCGG GCAGGGCGCG
CTGGCGGCCG TGCGCGGGTT GGTCGAGGAC GAAACCGCCG CCCGTTACGT CCGCAACAGC
GTGCTGGCCC CGATCGCGCC CCTGTGCGTC AAGCGCGACA CCGAGCAGAC CTCCTTCCCG
CCCCGCACCC TGGCCCTGCT GTGGGCCGCC CTGAGGGCCG AAGCCCCCAA GCAGGTCGAA
GAGGCCGCCG CCCGCTGCAA TCCGTGGGAC CTGGAGCAGG GTCCGCCGGA CGTCTTCGAC
GAACTTTGCA AGATCGCCGC CAAGGGCCTG CGGGCCCAGG CCGCCCCCGG CTTCCTGGCC
CTCGACGCGA TCTGCGACAT CGACGAGCTG GCGTCGTGCC TGGAGCTGTC GCACATCGTC
CGCGCGGCCC TGCCAAAGCT GTCGGAATGG GTCAGCCGGA TGAGTGACGA GCGTGCCTCC
GCCGCTCGCC TGGCCTACAA GGACGCCTGC ACCATCCGCC CGGACGCCGG GCCTCTGCTG
TTCGAGATGA TGGCGGCGCA CCTGCCCGAC GACTGGCGGA TCCTGCGCGT GATCTCGGCG
GTCATGGACC GGCCCGGCGA CAGGTTCTGG GCCTCTTCGG AGGTCAGCGT GTTCGGCGAA
CGGGTGCTGG CCGACATCGA GAAGAACATC GACTACATCC AGGGCTTCGA CGCGGACAAG
GGCGAGGCCG AGGGGCGCAA GGCCGCGCTC GCCGCCCAGA AGGTCTCGCA GGAGATCACC
GAGTTCGAGC AGTCGGTGAA CCTGGCCAAG GACGGCCCGT GGGGCCGGCG GATTTCCAAG
CACAAGCAGG GCGTCGCCCA GGCCGTCGAG AGCCGGATGA ACAAGGCCGA GAACGAGCTG
CTGGCGGCCT TGCCGCTGCG GCCGATCTCG ATCCTCGGCG GCAAGAAGGG CAAGGGCGTT
CCACAACTGG TCGTCGAACC GGATCCGGCG GCGCACCGCC GCGCGACCGC CGCCCTGGCC
TTCATCGCCG ACGTGCGCAG TTGCGCCATG CAGAGCGGCT ACGGCGCCAG CCGCGCCAAG
GCCCTGGAAA AGATCAACAG CCGCCTGGAC CAGTATATCG AGGACATCCT GCACGTGGTC
CGCACCGGCG ACGGCGGCGA CCCGGTCCTG GCCCGGCTCT ATGTCGACAT GGCCGCCGGC
TACATCGCCT TCAGCCGCGA CGAGAAGACC GCCGAGATCG TCCGCCGCCG CGCCGCCGCG
GCGATGGCGG CCGCGGCCTA G
 
Protein sequence
MAGVAANKLL MVRKLVETAP DAALRSLELA LSGPAGGQGA LAAVRGLVED ETAARYVRNS 
VLAPIAPLCV KRDTEQTSFP PRTLALLWAA LRAEAPKQVE EAAARCNPWD LEQGPPDVFD
ELCKIAAKGL RAQAAPGFLA LDAICDIDEL ASCLELSHIV RAALPKLSEW VSRMSDERAS
AARLAYKDAC TIRPDAGPLL FEMMAAHLPD DWRILRVISA VMDRPGDRFW ASSEVSVFGE
RVLADIEKNI DYIQGFDADK GEAEGRKAAL AAQKVSQEIT EFEQSVNLAK DGPWGRRISK
HKQGVAQAVE SRMNKAENEL LAALPLRPIS ILGGKKGKGV PQLVVEPDPA AHRRATAALA
FIADVRSCAM QSGYGASRAK ALEKINSRLD QYIEDILHVV RTGDGGDPVL ARLYVDMAAG
YIAFSRDEKT AEIVRRRAAA AMAAAA