Gene Caul_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3785 
Symbol 
ID5901247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4104732 
End bp4105829 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content67% 
IMG OID641564308 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001685410 
Protein GI167647747 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000208707 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACATCG GACGCCGCGA CCTCCTTCTC GCCGCCGCTT CTCTCGCCGT CGCCTCAGCC 
GCGGGAGGGG CCCGCGCCGC CAGCGACCGC AAGATCGGCT ACGCGATCGT CGGCCTGGGC
TATTACGGCC TCAACGTCAT CCTGCCGCAG TTCGTCAACT GCGAGCACAG CCGGGTCACG
GCCCTGGTCA GCGGCGACCC GGCCAAGGCT CGCGCGACCG CGGCGCGCTA CGGAGTGCCG
GAGCGCTCGA TCTATTCCTA CGAGACCTTC GACCAGATCC GCGACAATCC CGACGTCGAT
GTCGTCTATG TCATCCTGCC CAATTCCATG CACGCCGAGT ACACGATCCG CGCCGCCAAG
GCCGGCAAGC ATGTGATGTG CGAGAAGCCG ATGGCGACGT CGGTCGCCGA GTGCGAGGCG
ATGATCGCCG CCTGCAAGAC GGCTGGGCGC AAGCTGATGA TCGGCTATCG CTGCCATTTC
GAGGCCACCA ATCTCGAAGC CGTGCGCCTG GCCCGCGCGG GCGCGGCCGG CCACGTCCGC
TATGTGCGCT CCGAGCACGG CTTCGTGCAG GGCGACCCGT CGAAGTGGCG GTTGAAGAAG
GCGATGGCGG GCGGCGGGTC GCTGATGGAC ATGGGCGTCT ACAGCCTGCA GGCGGCGCGA
TACATGACCG GCGAGGAGCC CGTCTCGGTC ACGGCGCGCG AGTCCACCGA TCGCGGCGAT
CCCCGGTTCA CCGAGGTGGA GGACATGATC GAATGGCTGC TGAAGTTTCC GTCCGGGGCG
ATCGCCAGCT GCCTGTCGAC CTACAGCGCA AATCAGAACC ACGTCCTGCT GATGGGCGAC
AAGGGACGGA TCGAGATGGA GCCCGCCACC CGCTATGACG GCAATCGCCT GTGGACCGGC
AGGGACGGCC GCGCGGACGA AATCGCGCCG CCGCCTGGCC CGGCCAAGAC CCAGTTCGCC
GGCCAACTGG ATCACCTGGC CGATTGCATT CGAACCAACC GCACGCCGAT CGTTTCGGGA
GAAGAGGGCC TGCGCGACAT GCGGATCATC GAGGCGATCT ACCGGTCGGC GCGGGAGGAA
AGCACGATCA AGCTGTAG
 
Protein sequence
MDIGRRDLLL AAASLAVASA AGGARAASDR KIGYAIVGLG YYGLNVILPQ FVNCEHSRVT 
ALVSGDPAKA RATAARYGVP ERSIYSYETF DQIRDNPDVD VVYVILPNSM HAEYTIRAAK
AGKHVMCEKP MATSVAECEA MIAACKTAGR KLMIGYRCHF EATNLEAVRL ARAGAAGHVR
YVRSEHGFVQ GDPSKWRLKK AMAGGGSLMD MGVYSLQAAR YMTGEEPVSV TARESTDRGD
PRFTEVEDMI EWLLKFPSGA IASCLSTYSA NQNHVLLMGD KGRIEMEPAT RYDGNRLWTG
RDGRADEIAP PPGPAKTQFA GQLDHLADCI RTNRTPIVSG EEGLRDMRII EAIYRSAREE
STIKL